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connection, Riemannian curvature and related topics. 

I tried to make the presentation as light as possible: gave more details in 
smooth regular situations and referred to the literature in more complicated 
cases. There is an evidence that the results described in the notes and treated 
in technical papers we refer to are just parts of a united beautiful subject to be 
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and Optimal Control Theory. I will be happy if the course and the notes 
encourage some young ambitious researchers to take part in the discovery 
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Part I 

Lagrange multipliers' geometry 

1 Smooth optimal control problems 

In these lectures we discuss some geometric constructions and results emerged 
from the investigation of smooth optimal control problems. We'll consider 
problems with integral costs and fixed endpoints. A standard formulation of 
such a problem is as follows: Minimize a functional 

ti 

Jg(«(-)) = / p(q(t),u(t))dt, (1) 
to 

where 

q(t) = f(q(t), u(t)), u(t) EU, Vt G [t , h], (2) 

q(to) = Qo, q(ti) — Qi- Here q(t) G M n , U C M. k , a control function u(-) is 
supposed to be measurable bounded while q(-) is Lipschitzian; scalar function 
<p and vector function / are smooth. A pair («(•), (?(•)) is called an admissible 
pair if it satisfies differential equation (2) but may violate the boundary 
conditions. 

We usually assume that Optimal Control Theory generalizes classical Cal- 
culus of Variations. Unfortunately, even the most classical geometric varia- 
tional problem, the length minimization on a Riemannian manifold, cannot 
be presented in the just described way. First of all, even simplest mani- 
folds, like spheres, are not domains in M n . This does not look as a serious 
difficulty: we slightly generalize original formulation of the optimal control 
problem assuming that q(t) belongs to a smooth manifold M instead of W 1 . 
Then q{t) is a tangent vector to M i.e. q(t) G T q ^M and we assume that 
f(q,u) G T q M, Wq,u. Manifold M is called the state space of the optimal 
control problem. 



3 



Now we'll try to give a natural formulation of the length minimization 
problem as an optimal control problem on a Riemannian manifold M. Rie- 
mannian structure on M is (by definition) a family of Euclidean scalar prod- 
ucts (•, -) q on T q M, q G M, smoothly depending on q. Let fi(q), ■ ■ ■ , f n {o) 
be an orthonormal basis of T q M for the Euclidean structure (•, -) q selected in 
such a way that fi(q) are smooth with respect to q. Then any Lipschitzian 
curve on M satisfies a differential equation of the form: 

n 

? = l>(0/i(9), (3) 

where Mj(-) are measurable bounded scalar functions. In other words, any 
Lipschitzian curve on M is an admissible trajectory of the control system 

n / n \ V 2 

(3). The Riemannian length of the tangent vector ^Uifi(q) is I Yl u l J 

i=l \i=l J 

Hence the length of a trajectory of system (3) defined on the segment [to,ti] 
t ( n Y /2 

is £(u(-)) = J t ^ I ^2uj(t) J dt. Moreover, it is easy to derive from the 
Cauchy-Schwarz inequality that the length minimization is equivalent to 

. n 

the minimization of the functional J^(u(-)) = J2u?(t)dt. The length 

i=i 

minimization problem is thus reduced to a specific optimal control problem 
on the manifold of the form (1), (2). 

Unfortunately, what I've just written was wrong. It would be correct if 
we could select a smooth orthonormal frame fi(q), q G M, i = l,...,n. 
Of course, we can always do it locally, in a coordinate neighborhood of M 
but, in general, we cannot do it globally. We cannot do it even on the 2- 
dimensional sphere: you know very well that any continuous vector field on 
the 2-dimensional sphere vanishes somewhere. We thus need another more 
flexible formulation of a smooth optimal control problem. 

Recall that a smooth locally trivial bundle over M is a submersion 
7i : V — > M, where all fibers V q = ft~ l {q) are diffeomorphic to each other 
and, moreover, any q G M possesses a neighborhood O q and a diffeomor- 
phism $4 : O q x V q -> n~ l (O q ) such that $ q (q',V q ) = V q >, Vg' G O q . In 
a less formal language one can say that a smooth locally trivial bundle is a 
smooth family of diffeomorphic manifolds V q (the fibers) parametrized by the 
points of the manifold M (the base). Typical example is the tangent bundle 
TM = (J T q M with the canonical projection ir sending T q M into q. 
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Definition. A smooth control system with the state space M is a smooth 
mapping / : V — ■> TM, where V is a locally trivial bundle over M and 
f(V q ) C T g M for any fiber V q , q G M. An admissible pair is a bounded 1 
measurable mapping v(-) : [to,ti\ — > V such that i i— > 7r(t>(t)) = g(t) is a 

Lipschitzian curve in M and <j(t) = f(v(t)) for almost all t G [to,ti]- Integral 

ti 

cost is a functional J* 1 (f(-)) = / <p(v(t))dt, where (p is a smooth scalar 

to 

function on V. 

Remark. The above more narrow definition of an optimal control problem 
on M was related to the case of a trivial bundle V = M x U, V q = {q} x U. 
For the length minimization problem we have V = TM, f = Id, tp(v) = 
(v,v) q , VveT q M, qeM. 

Of course, any general smooth control system on the manifold M is locally 
equivalent to a standard control system on IR n . Indeed, any point q G M 
possesses a coordinate neighborhood O q diffeomorphic to M. n and a mapping 
$ g : O q x V q — > 7r -1 (O g ) trivializing the restriction of the bundle V to O g ; 
moreover, the fiber V q can be embedded in IR fc and thus serve as a set of 
control parameters U. 

Yes, working locally we do not obtain new systems with respect of those 
in W 1 . Nevertheless, general intrinsic definition is very useful and instructive 
even for a purely local geometric analysis. Indeed, we do not need to fix 
specific coordinates on M and a trivialization of V when we study a control 
system defined in the intrinsic way. A change of coordinates in M is actually 
a smooth transformation of the state space while a change of the trivialization 
results in the feedback transformation of the control system. This means that 
an intrinsically defined control system represents actually the whole class 
of systems that are equivalent with respect to smooth state and feedback 
transformations. All information on the system obtained in the intrinsic 
language is automatically invariant with respect to smooth state and feedback 
transformations. And this is what any geometric analysis intends to do: to 
study properties of the object under consideration preserved by the natural 
transformation group. 

We denote by L oo ([t , ti]; V) the space of measurable bounded mappings 
from [t ,ti\ to V equipped with the Loo-topology of the uniform convergence 
on a full measure subset of [t , ti]. If V would an Euclidean space, then 

1 the term 'bounded' means that the closure of the image of the mapping is compact 
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Loo([to,ti]',V) would have a structure of a Banach space. Since V is only 
a smooth manifold, then L oo ([i , ti]; V) possesses a natural structure of a 
smooth Banach manifold modeled on the Banach space Loo ([to, ti}; IR dimy ). 

Assume that V — > M is a locally trivial bundle with the n-dimensional 
base and m-dimensional fibers; then V is an (n + m)-dimensional manifold. 

Proposition 1.1 Let f : V — > TM &e a smooth control system; then the 
space V 0/ admissible pairs of this system is a smooth Banach submanifold 
o/L oo ([t ,ti];\/) modeled onR n x Loo([t , ti]; K m ). 

Proof. Let v(-) be an admissible pair and q(t) = n(v(t)), t e [to,ti]. 
There exists a Lipschitzian with respect to t family of local trivializations 
Rt '■ O q (t) x U — > 7r -1 (O g (t)), where [/ is diffeomorphic to the fibers V^. The 
construction of such a family is a boring exercise which we omit. 
Consider the system 

q = foR t (q,u), u E U. (4) 

Let v(t) = R t (q(t),u(t)); then R t , t < t < t 1; induces a diffeomorphism of 
an Loo-neighborhood of (q(-),u(-)) in the space of admissible pairs for (4) on 
a neighborhood of v(-) in V. Now fix t G [to, For any q close enough to 
q(t) and any u'(-) sufficiently close to u(-) in the Loo-topology there exists 
a unique Lipschitzian path q'(-) such that q'(t) — f o R t (q'(t),u'(t))), t < 
t < ti, q'(i) = q; moreover the mapping (g, «'(■)) 1— > g'(-) is smooth. In other 
words, the Cartesian product of a neighborhood of q(t) in M and a neighbor- 
hood of u(-) in Lqo ([to, ti], C/) serves as a coordinate chart for a neighborhood 
of v (•) in V. This finishes the proof since M is an n-dimensional manifold 
and Loo([to,ti], U) is a Banach manifold modeled on L oo ([t , ti], IR m ). □ 

An important role in our study will be played by the "evaluation map- 
pings" F t : v(-) 1— > g(t) = 7r(f (£)). It is easy to show that F t is a smooth 
mapping from V to M. Moreover, it follows from the proof of Proposition 1.1 
that Ft is a submersion. Indeed, q(t) = F t (v(-)) is, in fact a part of the 
coordinates of v(-) built in the proof (the remaining part of the coordinates 
is the control u(-). 

2 Lagrange multipliers 

Smooth optimal control problem is a special case of the general smooth con- 
ditional minimum problem on a Banach manifold W. The general problem 
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consists of the minimization of a smooth functional J : W — > R on the 
level sets of a smooth mapping $ : W — > iV, where iV is a finite- 

dimensional manifold. In the optimal control problem we have W = V, iV = 
MxM, $=(F to ,F tl ). 

An efficient classical way to study the conditional minimum problem is 
the Lagrange multipliers rule. Let us give a coordinate free description of 
this rule. Consider the mapping 

$ = (J,$):W^lxiV, $(w) = (J(w), $H), w G W. 

It is easy to see that any point of the local conditional minimum or maxi- 
mum (i.e. local minimum or maximum of J on a level set of $) is a crit- 
ical point of $. I recall that w is a critical point of $ if the differential 
D W Q : T W W — > T$r w \ (R x iV) is not a surjective mapping. Indeed, if D W Q 
would surjective then, according to the implicit function theorem, the image 
$(O w ) of an arbitrary neighborhood O w of w would contain a neighborhood 
of $(u>) = (J(w),$(w)); in particular, this image would contain an interval 
((J(w) — e, J(w) + e), <&(«;)) that contradicts the local conditional minimal- 
ity or maximality of J(w). 

The linear mapping D W Q is not surjective if and only if there exists a 
nonzero linear form £ on T"$( w ) (R x iV) which annihilates the image of D w &. 
In other words, £D W <& = 0, where £D W Q : T W W — > R is the composition of 
D w $ and the linear form £ : T^ {w) (R x JV) -»• R. 

We have T #(w) (R x N) = R x T^ w) N. Linear forms on (R x TV) con- 
stitute the adjoint space (R x N)* = R © T*, w) N, where T*, w) N is the ad- 
joint space of T$( W )M (the cotangent space to M at the point <E>(w)). Hence 
£ = z/ © ^ where f el, £ G T* (u)) 7V and 

= (z/ © £) (d w J, D w &) = vd w J + £D W <&. 
We obtain the equation 

vd w J + £D w <5> = 0. (5) 

This is the Lagrange multipliers rule: if w is a local conditional extremum, 
then there exists a nontrivial pair (u, £) such that equation (5) is satisfied. 
The pair (v, £) is never unique: indeed, if a is a nonzero real number, then 
the pair (au,a£) is also nontrivial and satisfies equation (5). So the pair is 
actually defined up to a scalar multiplier; it is natural to treat this pair as 

an element of the projective space P (wt © T^ w ^N^j rather than an element 

of the linear space. 
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The pair [y, £) which satisfies (5) is called the Lagrange multiplier asso- 
ciated to the critical point w. The Lagrange multiplier is called normal if 
v 7^ and abnormal if v — 0. In these lectures we consider only normal 
Lagrange multipliers, they belong to a distinguished coordinate chart of the 
projective space P (m © T* (w) N^j . 

Any normal Lagrange multiplier has a unique representative of the form 
(— 1,1); then (5) is reduced to the equation 

£D W <1> = d w J. (6) 

The vector £ G T£, W ^N from equation (6) is also called a normal Lagrange 
multiplier (along with {—!,£)). 



3 Extremals 

Now we apply the Lagrange multipliers rule to the optimal control problem. 
We have $ = (F to ,F tl ) :V^MxM. Let an admissible pair v G V be a 
critical point of the mapping (J^ 1 , <E>), the curve q(t) = n(v(t)), t < t < t\ 
be the corresponding trajectory, and i E ^( q (t ) q(ti))(^ x ^) be a normal 
Lagrange multiplier associated to v(-). Then 

eD v (F t0 ,F tl )=d v J%. (7) 

We have TT q{to) q{ti)) (M x M) = T* {to) M x T* {ti) M, hence £ can be presented 
in the form £ = (— A io , A tl ), where X ti G T*^M, i — 0,1. Equation (7) takes 
the form 

\ L D v F tl — X to D v F to = d v J^. (8) 

Note that X tl in (8) is uniquely defined by X to and v. Indeed, assume that 
X' tl D v F tl -X t0 D v F t0 = d v JH for some G T* {ti) M. Then (X' tl —X tl )D v F tl = 
0. Recall that F tl is a submersion, hence D v F tl is a surjective linear map 
and X' tl — A tl =0. 

Proposition 1.2 Equality (8) implies that for any t G [io,*i] ^ere exzsfe a 
unique X t G T*^M such that 

X t D v F t - X t0 D v F t0 = d v Jl o (9) 
and X t is Lipschitzian with respect to t. 
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Proof. The uniqueness of X t follows from the fact that F t is a submersion 
as it was explained few lines above. Let us proof the existence. To do that 
we use the coordinatization of V introduced in the proof of Proposition 1.1, 
in particular, the family of local trivializations R t : O q ( t ) x U — > n~ 1 (O q ( t ))- 
Assume that v(t) = R t (q(t),u(t)), t < t < t\, where v(-) is the referenced 
admissible pair from (8). 

Given r G [t , ti], q G O q i T \ let t \— > Q^iq) be the solution of the differ- 
ential equation q = R t (q,u(t)) which satisfies the condition Q r T {q) = q. In 
particular, Q^qij)) = q(t). Then Q\ is a diffeomorphism of a neighborhood 
of q(r) on a neighborhood of q(i). We define a Banach submanifold V T of 
the Banach manifold V in the following way: 

V T = {v' G V : 7r(u'(t)) = (7r(u'(r))), r < t < h}. 
It is easy to see that F tl = Q* 1 o F T and J* 1 = a T o F T , where 



vv 



Vr 



Vr 



a(q) = J (p ($ t ((5*(g),u(0)) ^- On the other hand, the set {f ' G V : w'|[t , T ] G 

T 

V T |j to r j} is a neighborhood of v in V. The restriction of (8) to V T gives: 

A, (Qt 1 o ^r) - Xt D v F t0 = d v J T to + d v (a T o F T ) . 
Now we apply the chain rule for the differentiation and obtain: 

\ T D V F T — X to D v F to = d v Jt o , 
where A T = \ tl D q{j )Q^ - d q ( T )a T . □ 

Definition. A Lipschitzian curve t i— > A t , to < ^ < t±, is called a normal 
extremal of the given optimal control problem if there exists an admissible 
pair v G V such that equality (9) holds. The projection q(t) = 7r(A t ) of a 
normal extremal is called a (normal) extremal path or a (normal) extremal 
trajectory. 

According to Proposition 1.2, normal Lagrange multipliers are just points 
of normal extremals. A good thing about normal extremals is that they 
satisfy a nice differential equation which links optimal control theory with a 
beautiful and powerful mathematics and, in many cases, allows to explicitly 
characterize all extremal paths. 
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4 Hamiltonian system 



Here we derive equations which characterize normal extremals; we start from 
coordinate calculations. Given r G [£o,£i], fix a coordinate neighborhood O 
in M centered at q(r), and focus on the piece of the extremal path q(-) which 
contains q(-) and is completely contained in O. Identity (9) can be rewritten 
in the form 

\ t D v F t - \ T D V F T = d v J l T , (10) 

where q(t) belongs to the piece of q(-) under consideration. Fixing coordi- 
nates and a local trivialization of V we (locally) identify our optimal con- 
trol problem with a problem (1), (2) in R n . We have T*R n = R n x ]R n = 
{(p,q) : p,q E K n }, where T*R n = R n x {q}. Then A t = {p(t),q(t)} and 
\ t D v F t - = (p(t),D v F t -) = D v (p(t),F t ). 

Admissible pairs of (2) are parametrized by q = F T (v'), v' G V, and con- 
trol functions «'(')> the pairs have the form: v' = (u'(-), ?'(•; q, u' (•))), where 
§- t q'{t\ q, u'(-)) = f (q'(t; q, u' (■)) , u' (t)) for all available t and q'(r; q, u(-)) = q. 
Then F t (v') = q'(t; q,u'(-)). 

Now we differentiate identity (10) with respect to t: -^D v (p(t), F t ) = 
^d v J* and change the order of the differentiation D v -^(p(t), F t ) = d v ^J\. 
We compute the derivatives with respect to t at t — r: 

^(p(t),F t )\ t=r = (p(r),q) + (p(r),f(q,u'(r)), = <p(q, u'{r)). 

Now we have to differentiate with respect to v'(-) = (u'(-), ?'(•))■ We however 
see that the quantities to differentiate depend only on the values of u'(-) and 
q'(-) at r, i.e. on the finite-dimensional vector {u'{r),q). We derive: 

m + §- q (p(r)J(Q(r),u(r))) = 

^(p(r),f(q(r)Mr))) = ^(q(r),u(r)), 

where v(-) = (q(-),u(-)). 

Of course, we can change r and perform the differentiation at any avail- 
able moment t. Finally, we obtain that (10) is equivalent to the identities 

Pit) + ^ «p(f), f(q(t),u(t))) - <p(q(t),u(t))) = 0, 
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— «p(t), f(q(t),u(t))) - <f(q(t),u(t))) = 0, 



which can be completed by the equation q = f (q(t) , u(t)) . We introduce a 
function h(p, q, u) = (p, f(q, u)) — (p(q, u) which is called the Hamiltonian of 
the optimal control problem (1), (2). This function permits us to present the 
obtained relations in a nice Hamiltonian form: 

( . dh 

* , ^ 9 ,«)=o. (id 

A more important fact is that system (11) has an intrinsic coordinate 
free interpretation. Recall that in the triple (p, q, u) neither p nor u has an 
intrinsic meaning; the pair (p, q) represents A G T*M while the pair (q,u) 
represents v G V. First we consider an intermediate case V = M x U (when 
u is separated from q but coordinates in M are not fixed) and then turn to 
the completely intrinsic setting. 

If V = M x U, then f : M x U — > TM and f(q,u) G T ? M. The 
Hamiltonian of the optimal control problem is a function h : T*M x U — > R 
defined by the formula /i(A, w) = A(/(g, u))-ip(q, u), VA G T*M, q <E M, u e 

U. For any u E U we obtain a function /i u /i(-, -u) on T*M. The cotangent 
bundle T*M possesses a canonical symplectic structure which provides a 
standard way to associate a Hamiltonian vector field to any smooth function 
on T*M. We'll recall this procedure. 

Let 7T : T*M -> M be the projection, vr(T*M) = {g}. The Liouville 
(or tautological) differential 1-form ^ on T*M is defined as follows. Let 
<; A : T X (T*M) -> R be the value of ^ at A G T*M, then ^ A = Aovr*, the compo- 
sition of 7T* : T\(T*M) -> T w(A) M and the cotangent vector A : T^ (A) M -> R. 
The coordinate presentation of the Liouville form is: <r( Pi? ) = (p, rfg) = 

n 

p l dq\ where p — (p 1 , . . . ,p n ), g = (g 1 , . . . , q n ). The canonical symplectic 

i=i 

structure on T*M is the differential 2-form cr = d^; its coordinate represen- 

n 

tation is: a = o?p l A dq l . The Hamiltonian vector field associated to a 
i=i 

smooth function a : T*M — * R is a unique vector field a on T*M which sat- 
isfies the equation a(-,a) = da. The coordinate representation of this field 
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n r \ 

is: a = — J^^j- Equations (11) can be rewritten in the form: 

i=i ^ ' 

dh 

A = h u (X), — (A,«) = 0. (12) 

Now let V be an arbitrary locally trivial bundle over M. Consider the Carte- 
sian product of two bundles: 

T*M x M V = {(A, v) : v G V q , A G T*M, g G M} 

that is a bundle over M whose fibers are Cartesian products of the correspon- 
dent fibers of V and T*M. Hamiltonian of the optimal control problem takes 
the form h(\,v) = X(f(v)) — ip(v); this is a well-defined smooth function on 
T*M x M U. Let p : T*M x M V -> T*M be the projection on the first factor, 
p : (X,v) I— > A. Equations (11) (or (12)) can be rewritten in the completely 
intrinsic form as follows: (pV)„(-, A) = dh. One may check this fact in any 
coordinates; we leave this simple calculation to the reader. 

Of course, by fixing a local trivialization of V, we turn the last relation 
back into a more convinient to study equation (12). A domain V in T*M is 
called regular for the Hamiltonian h if for any A G T> there exists a unique 
solution u = u(X) of the equation §^(A, u) = 0, where u(X) is smooth with 
respect to A. In particular, if U is an afline space and the functions u \— > 
h(X, u) are strongly concave (convex) and bounded from above (below) for 
A G V, then V is regular and u(X) is defined by the relation 

h(X, u(X)) = max h(X, u) I h(X, u(X)) = min h(X, u) 

In the regular domain, we set H{X) = h(X,u(X)), where |^(A,m(A)) = 0. It 
is easy to see that equations (12) are equivalent to one Hamiltonian system 
A = H(X). Indeed, the equality d {x ,u(\))h = d x h n (\) + ^r~du = d x h U (\) 
immediately implies that H(X) = hu(\)(X). 



5 Second order information 

We come back to the general setting of Section 2 and try to go beyond the 
Lagrange multipliers rule. Take a pair (£, w) which satisfies equation (6). We 
call such pairs (normal) Lagrangian points. Let — z. If w is a regular 
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point of $, then $ _1 (,2) D O w is a smooth codimension dim submanifold 
of W, for some neighborhood O w of w. In this case w is a critical point of 
^\0- 1 (z)no ' ^ e are § om S t° compute the Hessian of at w without 

resolving the constraints <&(w) = z. The formula we obtain makes sense 
without the regularity assumptions as well. 

Let s t— > 7(s) be a smooth curve in such that 7(0) = w. Differen- 

tiation of the identity $(7(5)) = z gives: 

D w ^ = 0, D£$(7, 7) + A^7 = 0, 

where 7 and 7 are the first and the second derivatives of 7 at s = 0. We also 
have: 

^J( 7 ( S ))| s =o = D 2 w J(j, 7) + D^Jty eq i 6) 
^^(7,7) +^$7 = D 2 w J(j,j) - ^$(7,7)- 

Finally, 

Hess.(j| $ _ 1(2) ) = (DlJ-£D 2 w <S>)\ keiD ^. (13) 

Proposition 1.3 If quadratic form (13) is positive (negative) definite, then 
w is a strict local minimizer (maximizer) °/^| 4 -i(- z y 

If w is a regular point of $, then the proposition is obvious but one can 
check that it remains valid without the regularity assumption. On the other 
hand, without the regularity assumption, local minimality does not imply 
nonnegativity of form (13). What local minimality (maximality) certainly 
implies is nonnegativity (nonpositivity) of form (13) on a finite codimension 
subspace of ker D w § (see jTJ Ch. 20] and references there). 

Definition. A Lagrangian point (£,w) is called sharp if quadratic form (13) 
is nonnegative or nonpositive on a finite codimension subspace of ker D w &. 

Only sharp Lagrangian points are counted in the conditional extremal 
problems under consideration. Let Q be a real quadratic form defined on a 
linear space E. Recall that the negative inertia index (or the Morse index) 
indQ is the maximal possible dimension of a subspace in E such that the 
restriction of Q to the subspace is a negative form. The positive inertia 
index of Q is the Morse index of —Q. Each of these indices is a nonnegative 
integer or +00. A Lagrangian point (£, w) is sharp if the negative or positive 
inertia index of form (13) is finite. 
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In the optimal control problems, W is a huge infinite dimensional manifold 
while N usually has a modest dimension. It is much simpler to characterize 
Lagrange multipliers in T*N (see the previous section) than to work directly 
with J\$,-i( z y Fortunately, the information on the sign and, more generally, 
on the inertia indices of the infinite dimensional quadratic form (13) can also 
be extracted from the Lagrange multipliers or, more precisely, from the so 
called C-derivative that can be treated as a dual to the form (13) object. 

^-derivative concerns the linearization of equation (6) at a given La- 
grangian point. In order to linearize the equation we have to present its left- 
and right-hand sides as smooth mappings of some manifolds. No problem 
with the right-hand side: w i— > d w J is a smooth mapping from W to T*W. 
The variables (£, w) of the left-hand side live in the manifold 

$*T*iV = {(£,w) : £ G T* {w) , w G W} C T*N x W. 

Note that Q*T*N is a locally trivial bundle over W with the projector 7r : 
(£,w) i — y w; this is nothing else but the induced bundle from T*N by the 
mapping $. We treat equation (6) as the equality of values of two mappings 
from &*T*N to T*W. Let us rewrite this equation in local coordinates. 

So let N = R m and W be a Banach space. Then T*N = R m * x R m 
(where T Z N = R m * x {z}), T*W = W* x W, $*T*iV = R m * x R m x W. 
Surely, R m * = R m but in the forthcoming calculations it is convenient to 
treat the first factor in the product R m * x R m as the space of linear forms on 
the second factor. We have: £ = (£, z) G R m * x IR m and equation (6) takes 
the form 

d$ dJ 

c *; = ^ Mm) = z - (14) 

Linearization of system (14) at the point (£,z,w) reads: 
d 2 § . , . d 2 J . , . d$ , , 

c ^ + c ^ (w '- ) = ^ (ro '- ) - ^= 2 ' (15) 

We set 

C° {ttW) ($) = {£' = (C',z') G T e (T*N) : 3w' G W s.t. (C',^,w') satisfies (15)}. 

Note that subspace £^ U) - ) ( ( I) C T e (T*N) does not depend on the choice of 
local coordinates. Indeed, to construct this subspace we take all (£', w') G 
T(e, w )(Q*T* N) which satisfy the linearized equation (6) and then apply the 
projection (£ f ,w f ) i— > £'. 
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Recall that T^(T*N) is a symplectic space endowed with the canonical 
symplectic form an (cf. Sec. 4). A subspace S C T^(T*N) is isotropicii ag\s = 
0. Isotropic subspaces of maximal possible dimension m = ^ dimTi(T*N) 
are called Lagrangian subspaces. 

Proposition 1.4 C% W J&) is an isotropic subspace of Ti>(T* N) . if dim W < 
oo ; then C® lw ^(Jb) is a Lagrangian subspace. 

Proof. First we'll prove the isotropy of C° {e w) ($). Let (£', z'), ((", z") G 
T e (T*N). We have a e (((', z'), ((", z")) = ('z" - ("z'; here the symbol (z 
denotes the result of the application of the linear form ( G M m * to the vector 
z G M. n or, in the matrix terminology, the product of the row ( and the 
column z. Assume that ((',z',w') and ((",z",w") satisfy equations (15); 
then 

Cz =c d^ w =^ (w ' w) -W w ' w) - (16) 

The right-hand side of (16) is symmetric with respect to w' and w" due 
to the symmetry of second derivatives. Hence ('z" = ("z'. In other words, 
a e (((',z'),((",z")) = 0. So £^ w -j($) is isotropic and, in particular, 

dim(4 !tt #)) <m. 

Now show that the last inequality becomes the equality as soon as W is 
finite dimensional. Set Q = ^ — and consider the diagram: 

,,d® nf I x left (r , ,s right ( , d$ A 

Then C9g w \(®) = right(ker(left)). Passing to a factor space if necessary we 
may assume that ker(/e/t) D kei(right) = 0; this means that: 

^—w 1 & Q(w',-) = Q w' = 0. (17) 

aw 

Under this assumption, dim jC^ w ^(^) = dimker(/e/t). On the other hand, 
relations (17) imply that the mapping left : lR m * x W — > W* is surjective. 
Indeed, if, on the contrary, the map left is not surjective then there exists 
a nonzero vector v G (W*)* = W which annihilates the image of Ze/t; in 
other words, = 0, VC',tu'. Hence = & Q(u, •) = that 

contradicts (17)!" It follows that dim £j <j1u) ($) = dim(M m * x W) - dimW* = 
m. □ 
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For infinite dimensional W, the space may have dimension 

smaller than m due to an ill-posedness of equations (15); to guarantee di- 
mension m one needs certain coercivity of the form C^~fr- 1 am n °t going 
to discuss here what kind of coercivity is sufficient, it can be easily recon- 
structed from the proof of Proposition 1.4 (see also 0). Anyway, indepen- 
dently on any coercivity one can take a finite dimensional approximation of 
the original problem and obtain a Lagrangian subspace £% w \($) guaranteed 
by Proposition 1.4. What happens with these subspaces when the approxi- 
mation becomes better and better, do they have a well-defined limit (which 
would be unavoidably Lagrangian)? A remarkable fact is that such a limit 
does exist for any sharp Lagrangian point. It contains and is called 

the /^-derivative of $ at (£, w) . To formulate this result we need some basic 
terminology from set theoretic topology. 

A partially ordered set (21, -<) is a directed set if Vai, a2 G 21 3(3 G 21 such 
that a\ -< (3 and a 2 -< (3. A family {x Q }ae2i of points of a topological space X 
indexed by the elements of 21 is a generalized sequence in X. A point x G X 
is the limit of the generalized sequence {x Q }ae2i if f° r an Y neighborhood O x of 
x in X 3a G 21 such that x@ G C^, V/3 >- a; in this case we write x = limx a . 

Let to be a finite dimensional submanifold of W and w G tt). If (•£, if) is 
a Lagrangian point for $ = (J, $), then it is a Lagrangian point for A 
straightforward calculation shows that the Lagrangian subspace £% w \{$\to) 
depends on the tangent space W = T w tv rather than on ro, i.e. C°^ w ^\ m ) = 

($lw) as soon as r «itt> = T^ro' = W. We denote A w = £^ w) ($| ro ). 
Recall that Ajy is an m-dimensional subspace of the 2m-dimensional space 
Ti(T*N), i.e. A\y is a point of the Grassmann manifold of all m-dimensional 
subspaces in Ti(T*N). 

Finally, we denote by 2U the set of all finite dimensional subspaces of 
T W W partially ordered by the inclusion "c". Obviously, (2U, c) is a directed 
set and {A w } WeW is a generalized sequence indexed by the elements of this 
directed set. It is easy to check that there exists W G 2U such that A w D 
C% w \(&), WW D Wq. In particular, if£9^ ism-dimensional, then A Wo = 
C% W J&), WW D Wo, the sequence A w is stabilizing and C® lw ^(Jh) = limA^y- 

In general, the sequence A\y is not stabilizing, nevertheless the following 
important result is valid. 

Theorem 1. 1 If (£,w) is a sharp Lagrangian point, then there exists 
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We omit the proof of the theorem, you can find this proof in paper [H] with 
some other results which allow to efficiently compute limA^. Lagrangian 

subspace C(t,w){®) — limA^ is called the C-derivative of $ = ( J, $) at the 
Lagrangian point (£, w). 

Obviously, £^ jt0 )($) D One should think on C^,w)(^) as on a 

completion of C^^Q) by means of a kind of weak solutions to system (15) 
which could be missed due to the ill-posedness of the system. 

Now we should explain the connection between £(^ jU ,)($) and 
Hess m (j| $ _ 1 , J. We start from the following simple observation: 

Lemma 1.1 Assume that dimW < oo, w is a regular point of $ and 
kerD w $ n ker(£>2 j _ W 2 W <$>) = 0. Then 

kerHes Su ,(j|^ 1(z) ) = & C(e, w )($) D T E {T* Z N) = 0, 

i.e. quadratic form Hess w ( J\^-i^) is nondegenerate if and only if the sub- 
space Cy )W -)($) is transversal to the fiber T*N. 

Proof. We make computations in coordinates. First, Ti(T*N) = {(C, 0) : 
(' G IR n *}; then, according to equations (15), ((', 0) G if and only if 

there exists w G W such that 

aw aw z dw A aw 

Regularity of w implies that ('^ ^ and hence w' ^ as soon as (' ^ 0. 
Equalities (18) imply: g«v) - £0(w» = 0, Vt> G ker g, i.e. w' G 
ker Hess w ( J| $ _ x , J. Moreover, our implications are invertible: we could start 

from a nonzero vector w' G ker Hess M ,( ^l^-i^p and arrive to a nonzero vector 
(C',0) €£(*,„,)($). □ 

Remark. Condition kerD w $ D ker(D^,J — ^D^$) = from Lemma 1.1 is 
not heavy. Indeed, a pair (J, $) satisfies this condition at all its Lagrangian 
points if and only if is a regular value of the mapping (£, w) i— > £g — jp. 
Standard Transversality Theorem implies that this is true for generic pair 
(J,*). 
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6 Maslov index 



Lemma 1.1 is a starting point for a far going theory which allows to effectively 
compute the Morse index of the Hessians in terms of the /^-derivatives. 

How to do it? Normally, extremal problems depend on some parameters. 
Actually, z G iV is such a parameter and there could be other ones, which 
we do not explicitly add to the constraints. In the optimal control problems 
a natural parameter is the time interval t\ — to- Anyway, assume that we 
have a continuous family of the problems and their sharp Lagrangian points: 
£ T D Wr $ T = d Wr J T , t < r < n; let A(r) = C(e T ,w T )(®r)- Our goal is to 
compute the difference ind Hess Wn (J T1 | $ -i/ 2 J — ind Hess WT() ( J TU \^-i, z J in 

terms of the family of Lagrangian subspaces A(r); that is to get a tool to 
follow the evolution of the Morse index under a continuous change of the 
parameters. This is indeed very useful since for some special values of the 
parameters the index could be known a'priori. It concerns, in particular, 
optimal control problems with the parameter r = t± — t . If ti — to is very 
small then sharpness of the Lagrangian point almost automatically implies 
the positivity or negativity of the Hessian. 

First we discuss the finite-dimensional case: Theorem LI indicates that 
finite-dimensional approximations may already contain all essential infor- 
mation. Let Q T be a continuous family of quadratic forms defined on a 
finite-dimensional vector space. If kerQ T = 0, r < r < t±, then indQ T is 
constant on the segment [to,Ti]. This is why Lemma 1.1 opens the way to 
follow evolution of the index in terms of the /^-derivative: it locates values 
of the parameter where the index may change. Actually, /^-derivative allows 
to evaluate this change as well; the increment of indQ T is computed via so 
called Maslov index of a family of Lagrangian subspaces. In order to define 
this index we have to recall some elementary facts about symplectic spaces. 

Let E, a be a symplectic space, i.e. E is a 2n-dimensional vector space 
and a be a nondegenerate anti-symmetric bilinear form on E. The skew- 
orthogonal complement to the subspace F C E is the subspace F z = {x e 
E : a(x,F) = 0}. The nondegeneracy of a implies that dimF z = 2n — dimF. 
A subspace T is isotropic if and only if T z D T; it is Lagrangian if and only 
if T z = F. 

Let n = span{ei, . . . , e n } be a lagrangian subspace of E. Then there exist 
vectors fi, ■ ■ ■ , f n £ E such that a{e^ fj) = 5ij, where 5ij is the Kronecker 
symbol. We show this using induction with respect to n. Skew-orthogonal 
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complement to the space span{ei, . . . , e„_i} contains an element / which is 
not skew-orthogonal to e n ; we set f n = , 1 „ f. We have 

span{e n , /„} n span{e n , f n } z = 

and the restriction of a to span{e n , f n } z is a nondegenerate bilinear form. 
Hence span{e n , f n } z is a 2(n — l)-dimensional symplectic space with a La- 
grangian subspace span{ei, . . . , e n _i}. According to the induction assump- 
tion, there exist fi, ■ ■ ■ , f n -i such that o{e^ fj) = 5ij and we are done. 

Vectors ei, . . . , e n , /i, . . . , /„ form a basis of E; in particular, 
A = span{fi, . . . , f n } is a transversal to II Lagrangian subspace, E = II© A. 

If = tiCiej + zifj), i = 1,2, andCi = (£,---,C), * = *?) T , 

then cr(xi, rc 2 ) = — C2^i- The coordinates (, z identify E with R n * x ]R n ; 
any transversal to A n-dimensional subspace A C S has the following pre- 
sentation in these coordinates: 

A = {z T ,S A z) : z G W 1 }, 

where S\ is an n x n-matrix. The subspace A is Lagrangian if and only if 
S* A = S A . We have: 

Ann = {(-2 T ,0) : z G ker^}, 

the subspace A is transversal to II if and only if S\ is nondegenerate. 

That's time to introduce some notations. Let L(E) be the set of all 
Lagrangian subspaces, a closed subset of the Grassmannian G n (E) of n- 
dimensional subspaces in E. We set 

A^ = {A G L(E) : An A = 0}, 

an open subset of L(E). The mapping A i— > S A gives a regular parametriza- 
tion of A^ 1 by the n(n + l)/2-dimensional space of symmetric n x n-matrices. 
Moreover, above calculations show that L(E) = \J A*. Hence L(E) is a 

AeL(E) 

n(n+ l)/2-dimensional submanifold of the Grassmannian G n (E) covered by 
coordinate charts A*. The manifold L(E) is called Lagrange Grassmannian 
associated to the symplectic space E. It is not hard to show that any coordi- 
nate chart A* is everywhere dense in L(E); our calculations give also a local 
parametrization of its complement. 
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Given II G L(E), the subset 

M u = L(Z) \ IT* = {A G L(E) : A n n + 0} 

is called the train of II. Let Ao G A4n, dim(Ao fl IT) = k. Assume that 
A is transversal to both Ao and IT (i.e. A G Aj n IT*). The mapping 
A i— > S\ gives a regular parametrization of the neighborhood of Ao in Ain by 
a neighborhood of a corank k matrix in the set of all degenerate symmetric 
n x n-matrices. A basic perturbation theory for symmetric matrices now 
implies that a small enough neighborhood of Ao in .Mn is diffeomorphic 
to the Cartesian product of a neighborhood of the origin of the cone of all 
degenerate symmetric k x k- matrices and a (n(n+l) — /c(/c+l))/2-dimensional 
smooth manifold (see P3 Lemma 2.2] for details). We see that .Mn is not a 
smooth submanifold of L(E) but a union of smooth strata, .Mn = U > 

fe>0 

where M$ = {A G L(E) : dim(A n IT) = £;} is a smooth submanifold of 
L(S) of codimension k(k + l)/2. 

Let A(r), r G [t 0? ^i] be a smooth family of Lagrangian subspaces (a 
smooth curve in L(E)) and A(io),A(ii) G IT*. We are going to define the 
intersection number of A(-) and .Mn- It is called the Maslov index and is 
denoted /in(A(-)). Crucial property of this index is its homotopy invariance: 
given a homotopy A s (-), s G [t ,ti] such that A s (t ), A s (h) G IT* Vs G [0, 1], 
we have M A °(-)) = / u n (A 1 (-)). 

It is actually enough to define /xn(A(-)) for the curves which have empty 
intersection with M. u \ M.^ , the desired index would have a well-defined 
extension to other curves by continuity. Indeed, generic curves have empty 
intersection with .Mn \ -A^n anc ^' moreover, generic homotopy has empty 
intersection with .Mn \ Mn' since any of submanifolds , k = 2,...n 
has codimension greater or equal to 3 in L(E). Putting any curve in gen- 
eral position by a small perturbation, we obtain the curve which bypasses 
M n \ M { ^\ and the invariance with respect to generic homotopies of the 
Maslov index defined for generic curves would imply that the value of the 
index does not depend on the choice of a small perturbation. 

What remains is to fix a "coorientation" of the smooth hypersurface M.^ 
in X/(E), i. e. to indicate the "positive and negative sides" of the hypersurface. 
As soon as we have a coorientation, we may compute //n(A(-)) for any curve 
A(-) which is transversal to Ai^ and has empty intersection with .Mn\-Mn ■ 
Maslov index of A(-) is just the number of points where A(-) intersects Ai^ in 
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the positive direction minus the number of points where this curve intersects 
•Mjj m the negative direction. Maslov index of any curve with endpoints 
out of .Mn is defined by putting the curve in general position. Proof of the 
homotopy invariance is the same as for usual intersection number of a curve 
with a closed cooriented hypersurface (see, for instance, the nice elementary 
book by J. Milnor "Topology from the differential viewpoint", 1965). 

The coorientation is a byproduct of the following important structure on 
the tangent spaces to L(E). It happens that any tangent vector to L(H) at 
the point A G L(H) can be naturally identified with a quadratic form on A. 
Her we use the fact that A is not just a point in the Grassmannian but an n- 
dimensional linear space. To associate a quadratic form on A to the velocity 
A(t) G T A ^L(E) of a smooth curve A(-) we proceed as follows: given x G A(t) 
we take a smooth curve r ^ x(t) in E in such a way that x(r) G A(r), W 
and x(t) = x. Then we define a quadratic form A(t)(x), x G A(t), by the 
formula A(t)(x) = a(x,x(t)). 

The point is that a(x, x(t)) does not depend on the freedom in the choice 
of the curve r i— > x(t), although x(t) depends on this choice. Let us check 
the required property in the coordinates. We have x = (z T , S A ^z) for some 
zeR n and x(r) = (z(t) t , S a(t) z(t)). Then 

a(x, x(t)) = z T (S A{t )Z + S A{t) z) - z T S A{t) z = z T S A(t) z; 

vector z does not show up. We have obtained a coordinate presentation of 
A(t): 

A(t)(z T , S m z) = z T S m z, 

which implies that A i— > A, A G T A L(Y) is an isomorphism of T A L(T.) on the 
linear space of quadratic forms on A. 

We are now ready to define the coorientation of -M^. Assume that 
A(t) G i- e. A(t) nil = ~SLx for some nonzero vector x G S. In 

coordinates, x = (^ T ,0), where M.x = kerS'A(t). It is easy to see that A(t) 
is transversal to M.^ (i. e. S A / t ) is transversal to the cone of degenerate 
symmetric matrices) if and only if A(t)(x) ^ (i. e. z 1 S A (t)Z ^ 0). Vector 
x is defined up to a scalar multiplier and A(t)(ax) = a 2 A(t)(x) so that the 
sign of A(t)(x) does not depend on the selection of x. 

Definition. We say that A(-) intersects M.^ at the point A(t) in the positive 
(negative) direction if A(t)(x) > (< 0). 
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This definition completes the construction of the Maslov index. A weak 
point of the construction is the necessity to put the curve in general posi- 
tion in order to compute the intersection number. This does not look as an 
efficient way to thinks since putting the curve in general position is nothing 
else but a deliberate spoiling of a maybe nice and symmetric original object 
that makes even more involved the nontrivial problem of the localization of 
its intersection with Aiu- Fortunately, just the fact that Maslov index is 
homotopy invariant leads to a very simple and effective way of its computa- 
tion without putting things in general position and without looking for the 
intersection points with Aiu- 

Lemma 1.2 Assume that U A = A(r) fl A = 0, Vr G [t , *i] - Then 
/xn(A(-)) = ind^A^o)— ind^A^i), whereindS is the Morse index of the quadratic 
form z T Sz, z G W 1 . 

Proof. The matrices SA(t ) an d <SA(t ) are nondegenerate since A(£ ) H IT = 
A(ti) nil = (we define the Maslov index only for the curves whose endpoins 
are out of Aiu)- The set of nondegenerate quadratic forms with a prescribed 
value of the Morse index is a connected open subset of the linear space 
of all quadratic forms in n variables. Hence homotopy invariance of the 
Maslov index implies that /xn(A(-)) depends only on ind<SA(t ) an d ind<SA(ti)- 
It remains to compute /in of sample curves in A*, say, for segments of the 
curve A(-) such that 

(r-l o ... o \ 
r-2 ... \ 
6 6 ... T-nJ 

In general, given curve is not contained in the fixed coordinate neighbor- 
hood A* but any curve can be divided into segments A(-)|r r . jr i, i = 0, . . . , /, 
in such a way that A(r) G A* Vr G [tj, Tj+i], where Aj G IT*, i — 0, . . . , I; 
then /in(A(-)) = E^n (A(-)| [r . )T . +l] ) . 

i 

Lemma 1.2 implies the following useful formula which is valid for the 
important class of monotone increasing curves in the Lagrange Grassman- 
nian, i.e. the curves A(-) such that A(t) are nonnegative quadratic forms: 
A(t) > 0, V*. 

Corollary 1.1 Assume that A(r) > 0, Vr G [*o,*i] and {r G [*o,*i] : A(r) n 
n^0} is a finite subset of (t ,ti). Then 

MA(-))= dim(A(r)nn). □ 

re(*o,*i) 
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Corollary 1.1 can be also applied to the case of monotone decreasing curves 
defined by the inequality A(i) < 0, V£; the change of parameter t ^ t + ti—t 
makes the curve monotone increasing and and change sign of the Maslov 
index. 

Let me now recall that our interest to these symplectic playthings was 
motivated by the conditional minimum problems. As it was mentioned at 
the beginning of the section, we are going to apply this stuff to the case 
E = T £t (T*M), i T G TIM, IT = T £t (T; t M), A(r) = C (Et , Wt t ), where 
z T = <S> T (w T ). In this case, not only A but also II and even symplectic space 
S depend on r. We thus have to define Maslov index in such situation. This 
is easy. We consider the bundle 

{^,r)^ET iT (T*M), to<r<t!} (19) 

over the segment [t ,ti] induced from T(T*M) by the mapping r i— > £ T . 
Bundle (19) endowed with the symplectic structure and its subbundle 

{^r):ieT lT {TlM)} 

are trivial as any bundle over a segment. More precisely, let t G [£o,£i], 
£i = Tg t (T*M), Il t = T it (T* t M); then there exists a continuous with respect 
to t family of linear symplectic mappings E T : T £t (T*M) — > E t such that 
^t{T(, t (T* t M)) = U t , to < t < ti, S t = Id. To any continuous family of 
Lagrangian subspaces A(r) C Tg T (T*M), where A(tj) fl U ti = 0, i — 0, 1, we 
associate a curve S.A(-) : r i— > S r A(r) in the Lagrange Grassmannian L(E t ) 

def 

and set /i(A(-)) = /xn t (S A(-)). Homotopy invariance of the Maslov index 
implies that /in t (2.A(-)) does not depend on the choice of t and S T . 

Theorem 1.2 Assume that dim W < oo ; 

$ r = ( J T> | T ):W^KxM, r G [i , *i] 

a continuous one-parametric family of smooth mappings and (£ T ,w T ) is 
a continuous family of their Lagrangian points such that £ T ^ 0, w T is a 
regular point o/$ r , and ker D Wt $ t Piker (£)^ r J T — £ T Dl, T $ T ) = ; £ < t < h. 
Let z T = $(w T ), A(r) = £ (£ti1Ut) ($ t ). //Hess Wt .(J t .| s _i ), i = 1,2, are 

nondegenerate, then 

indHess Wto (Jt | s _i (jS(o) ) - ind Hess^ ( J tl | $ -i (2ti) ) = M A ('))- 
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Remark. If £ T = 0, then w T is a critical point of J T (without restriction to 
the level set of $ r ). Theorem 1.2 can be extended to this situation (with the 
same proof) if we additionally assume that kerHess^ r J T = for any r such 
that i T = 0. 

Proof. We introduce simplified notations: A T = D Wt $ t , Q t = D <2 u]t .] t — 
Et-D^^t-; the ^-derivative £(£ tjU , t )($ t ) = A(r) is uniquely determined by the 
linear map A T and the symmetric bilinear form Q T . Fix local coordinates in 
the neighborhoods of w T and z T and set: 

A(A, Q) = {(C, Av) : (A + Q(v, •) = 0} G L(R n * x R n ); 

then A r = A(A T , Q T ). 

The assumption ker A T fl ker Q T = implies the smoothness of the map- 
ping (A,Q) t— > A(v4,Q) for (A, Q) close enough to (A T ,Q T ). Indeed, as it 
is shown in the proof of Proposition 1.4, this assumption implies that the 
mapping left T : ((, v) \— > (A T + Q T (v, •) is surjective. Hence the kernel of the 
mapping 

(CtO-C^ + QOv) (20) 

smoothly depends on (A, Q) for (A, Q) close to (A T , Q T ). On the other hand, 
A(A, Q) is the image of the mapping (£, t>) (C, Av) restricted to the kernel 
of map (20). 

Now we have to disclose a secret which the attentive reader already knows 
and is perhaps indignant with our lightness: Q T is not a well-defined bilinear 
form on T Wt W, it essentially depends on the choice of local coordinates in 
M. What are well-defined is the mapping QT| kerA : ker A T T* t W (check 
this by yourself or see Subsec. 2.3]), the map A T : T Wt W — > T Zt M and, of 
course, the Lagrangian subspace A(r) = £« T)toT )($ r ). By the way, the fact 
that <5r| kcrA is well-defined guarantees that assumptions of Theorem 1.2 do 
not depend on the coordinates choice. 

Recall that any local coordinates {z} on M induce coordinates {{(,z) : 
C e R n *,z G W 1 } on T*M and T*M = {(C,0) : ( e M n *} in the induced 
coordinates. 

Lemma 1.3 Given z G M, £ G T~M \ {0}, and a Lagrangian subspace 
A G TeiT-M)™ C L(Tg(T*M)), there exist centered at z local coordinates on 
M such that A = {(0,z):z6 M n } in the induced coordinates on Tg(T*M). 
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Proof. Working in arbitrary local coordinates we have I = (Co,0), A = 
{(Sz,z) : z G IR n }, where S is a symmetric matrix. In other words, A is 
the tangent space at (Co, 0) to the graph of the differential of the function 
a(z) = (o z + \z^ Sz. any smooth function with a nonzero differential can be 
locally made linear by a smooth change of variables. To prove the lemma it 
is enough to make a coordinates change which kills second derivative of the 
function a, for instance: z i— > z + (z T Sz)(q . □ 

We continue the proof of Theorem 1.2. Lemma 1.3 gives us the way to take 
advantage of the fact that Q T depends on the choice of local coordinates in M. 
Indeed, bilinear form Q T is degenerate if and only if A r fl{(0, z) : z G W a } ^ 0; 
this immediately follows from the relation 

A T = {((,A T v):(A T + Q T (v, •) = 0}. 

Given t G [io,^i] take a transversal to T^ t {T* t M) and A(t) Lagrangian sub- 
space A t C Te t (T*M) and centered at z t local coordinates in M such that 
A t = {(0,z) : z G M n } in these coordinates. Then A(r) is transversal to 
{(0, z) : z G M™} for all r from a neighborhood O t of t in [t , ti\. Selecting an 
appropriate finite subcovering from the covering O t , t G [to,ti] of [to,ti\ we 
can construct a subdivision t = r < T\ < . . . < r k < r fc+1 = t\ of [to, ti] with 
the following property: Vi G {0, 1, . . . , k} the segment {z T : r G [r,, T i+1 ]} of 
the curve z T is contained in a coordinate neighborhood O l of M such that 
A T fl {(0,z) : z G M n } = Vr G [ri,Tj+i] in the correspondent local coordi- 
nates. 

We identify the form Q T with its symmetric matrix, i.e. Q T ( v i, v 2) — 
vjQ T v 2 . Then Q T is a nondegenerate symmetric matrix and 

A(r) = {(C, -A-Q^^C 1 "}, n < r < r l+1 . (21) 

Now focus on the subspace A(rj); it has a nontrivial intersection with {(^, 0) : 
C G lR n *} = T eT .(T* T M) if and only if the matrix A^Q'^Aj. is degenerate. 
This is the matrix of the restriction of the nondegenerate quadratic form 
v i— > f 1 "^" 1 ^ to the image of the linear map A^. Hence A Ti Q~^A^ T . can be 
made nondegenerate by the arbitrary small perturbation of the map A Ti : 
T Wt .W — > T Zt M. Such perturbations can be realized simultaneously for 
i = l,...,k 2 by passing to a continuous family r i— > A' T , t < r < t±, 

2 We do not need to perturb A to and A tk : assumption of the theorem and Lemma 1.1 
guarantee the required nondegeneracy property. 
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arbitrary close and homotopic to the family r i— > A T . In fact, A' T can be 
chosen equal to A T out of an arbitrarily small neighborhood of . . . , r^}. 
Putting now A' T instead of A T in the expression for A(r) we obtain a family 
of Lagrangian subspaces A'(r). This family is continuous (see the paragraph 
containing formula (20)) and homotopic to A(-). In particular, it has the 
same Maslov index as A(-). In other words, we can assume without lack of 
generality that Afa) n T Et _ (T* M) = 0, i = 0, 1, . . . , k + 1. Then /x(A(-)) = 

^ fi ^A(-)|^ T +i ]j • Moreover, it follows from (21) and Lemma 1.2 that 

Besides that, indQ Ti = indQ Ti+1 since Q T is nondegenerate for all r G [r^, r i+ i] 
and continuously depends on r. 

Recall that Hess^ ^t|^-i^ — < 5r| kcrA • In order to complete proof of 
the theorem it remains to show that 

indQ T = ind (Q T \ kcrAT ) + indKQ; 1 ^ (22) 

for r = Ti,T i+1 . 

Let us rearrange the second term in the right-hand side of (22). The 
change of variables v = Q~ 1 Ajz, z G W 1 , implies: ind (AtQ^AJ^J = 

ind (<5t| {q -i a t z:26]K „ } ) • We have: Q T (v,kerA T ) = if and only if Q T (v, •) = 

z T A T for some z G M n , i.e. v T Q T = z T A T , v = Q~ l Aj z. Hence the right- 
hand side of (22) takes the form 

indQ T = ind (Q T \ kcrAT ) + ind (Qt\ {v:Qt{vMtAt)=0} ) 

and Qt\^ v .q , v kcrA \ qi is a nondegenerate form for t — Ti, Tj+i. Now equality 

(22) is reduced to the following elementary fact of linear algebra: If Q is a 
nondegenerate quadratic form on IR m and E C W n is a linear subspace, then 
indQ = ind(Q| £ ) + ind (q\ e %) + dim(£ n E%), where E^ = {v G R m : 

£7) = 0} and E n E% = ker (Q| £ ) = ker (<2|^) . □ 

Remark. Maslov index /in is somehow more than just the intersection num- 
ber with A4u- It can be extended, in a rather natural way, to all continuous 
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curves in the Lagrange Grassmannian including those whose endpoint belong 
to A4u- This extension allows to get rid of the annoying nondegeneracy as- 
sumption for Hess mt . (7^| $ -i^ j) in the statement of Theorem 1.2. In general, 

Maslov index computes 1/2 of the difference of the signatures of the Hessians 
which is equal to the difference of the Morse indices in the degenerate case 
(see [3] for this approach). 



7 Regular extremals 

A combination of the finite-dimensional Theorem 1.2 with the limiting proce- 
dure of Theorem 1.1 and with homotopy invariance of the Maslov index allows 
to efficiently compute Morse indices of the Hessians for numerous infinite- 
dimensional problems. Here we restrict ourselves to the simplest case of a 
regular extremal of the optimal control problem. 

We use notations and definitions of Sections 3, 4. Let h(X, u) be the 
Hamiltonian of a smooth optimal control system and Xt, to < t < ti, be an 
extremal contained in the regular domain T> of h. Then X t is a solution of the 
Hamiltonian system A = H(X), where H(X) = h(X,u(X)), |^/i(A,u(A)) = 0. 

Let q(t) = 7i(X t ), t ,< t < ti be the extremal path. Recall that the pair 
(A to ,A 4 ) is a Lagrangian multiplier for the conditional minimum problem 
defined on an open subset of the space 

M X L^MlU) = {(q t )M-)) :qeM,u(-) G £«,([*,,, 17)}, 

where u(-) is control and q± is the value at t of the solution to the differential 
equation q = f(q,u(r)), r G [to,^i]- in particular, F t (q t ,u(-)) = q t . The cost 
is Jll(q t ,u(-)) and constraints are F to (q t ,u(-)) = q(0), q t = q{t). 

Let us set Jt(u) = J^(g(t), «(•))> ®t(u) = F to (q(t),u(-)). A covector 
A G T*M is a Lagrange multiplier for the problem ( J t , $ t ) if and only if 
there exists an extremal A r , t < r < t, such that A to = A, X t G T*^M. 
In particular, X tQ is a Lagrange multiplier for the problem ( J t , $ 4 ) associated 
to the control u(-) = u(X). Moreover, all sufficiently close to A to Lagrange 
multipliers for this problem are values at to of the solutions A(r), t < r <t 
to the Hamiltonian system A = H(X) with the boundary condition X(t) G 

We'll use exponential notations for one-parametric groups of diffeomor- 
phisms generated ordinary differential equations. In particular, e rH :T*M — ► 
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T*M, r 6 K, is a flow generated by the equation A = H(X), so that 
A(r') = e < - r ' _r ^(A(r), r, r' £ R, and Lagrange multipliers for the problem 
(J t ,$ t ) fill the n-dimensional submanifold e(*°~*^ (T*^M^j . 

We set $ t = (J t , $ t ); it is easy to see that the /^-derivative C(\ t u )($t) is 
the tangent space to e^ a (t^M^ , i.e. C (Xto , u) ($ t ) = el*°- t)/? T At (t^m). 

Indeed, let us recall the construction of the /^-derivative. First we linearize 
the equation for Lagrange multipliers at X to . Solutions of the linearized 
equation form an isotropic subspace u )(^t) °f the symplectic space 
T XtQ (T*M). If £° AtQ )M) (l>t) is a Lagrangian subspace (i.e. dim/^^t) = 
dimM), then £( A o = £(a <0 ,«)($*), otherwise we need a limiting proce- 

dure to complete the Lagrangian subspace. In the case under consideration, 
£° At u )($t) = e*°~ t ^Tx t (t*^M^ has a proper dimension and thus coincides 
with 

^(Ai ,u)( < ^t)- We can check independently that el*° t ^T\ t (t*^Mj is La- 
grangian: indeed, T\ t (t*^Mj is Lagrangian and ei*° ^ H : T\ t (T*M) — ► 

T\ tQ (T*M) is an isomorphism of symplectic spaces since Hamiltonian flows 
preserve the symplectic form. 

So t i-> £(A toiM )(5>t) is a smooth curve in the Lagrange Grassmannian 
L [T\ t (T*M)j and we can try to compute Morse index of 

Hess u (•V|#-i (5(to)) ) = Hess « ( J *olF t - 1 ( g (to))n^ 1 (g(ti))) 

via the Maslov index of this curve. Of course, such a computation has no 
sense if the index is infinite. 

Proposition 1.5 (Legendre condition) If quadratic form |^|(A 4 , -u(t)) is neg- 
ative definite for any t G [£o>£i]; then indHess u (j tl |$-i( 9 ( to )) J < 00 anc ^ 
Hess M ^t| $ -i( 9 ( t ))J positive definite for any t sufficiently close to (and 



strictly greater than) to. If |^|(A t , w(t)) ^ for some t E [to,tx], th 



indHess M (j t 4_ 1(?(to)) ) = 00. 



en 



We do not give here the proof of this well-known result; you can find it in 
many sources (see, for instance, the textbook [Jj). It is based on the fact 
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that p(\ t ,u(t)) = A(0(g(t),«(t)))-gf(g(t),u(t)) is the infinitesimal (for 
the "infinitesimally small interval" at t) version of Xt D^ tl — D^J tl while 



Iker D u * tl ' 



Hess u (^i| -i (9(to)) ) = i D l\ - K D l®h] 

Next theorem shows that in the 'regular' infinite dimensional situation 
of this section we may compute the Morse index similarly to the finite di- 
mensional case. The proof of the theorem requires some information about 
second variation of optimal control problems which is out of the scope of 
these notes. The required information can be found in Chapters 20, 21 of 
[7j. Basically, it implies that finite dimensional arguments used in the proof 
of Theorem 1.2 are legal also in our infinite dimensional case. 

We set: A(t) = el'°"' )# T Af (t; (4) m). 

Theorem 1.3 Assume that §^t(At, u(t)) is a negative definite quadratic form 
and u is a regular point of $ t , Vt G (toj^i]- Then: 

• The form Hess u (j^ l^- 1 ^^))) ^ s degenerate if and only if 
A(h) n A(* ) ± 0; 

• If 'A(ti) fl A(£o) = 0, then there exists i > to such that 

indHess u (^U-i ( , (to)) ) = ~V (a(-) | [tM] ) , Vr G (t ,t). □ 

Note that Legendre condition implies monotonicity of the curve A(-); 
this property simplifies the evaluation of the Maslov index. Fix some local 
coordinates in M so that T*M = {(p, q) G lR n * x ]R n }. 



Lemma 1.4 Quadratic form A(t) is equivalent (with respect to a linear change 

du T d 2 h, 
dp du 2 



of variables) to the form -0(A t ) = f n T g(A t , u(\ t ))f. 



Proof. Equality ttt = —it tt^jt is an easy corollary of the identities 

M j Opt yp yp j j 

H(p,q) = h(p,q,u(p,q)), f u \ u=aM = 0. Indeed, = 2 £|g + ffgg 
and #■ (§) = + = 0. Further, we have: 

dp \ du I dpdu du A dp ' 

^A(t) = j/:°' t)ff T Xt feM) = ef-^A _e- S T Xt+e (T; {t+£) M) . 



e=0 
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Set A(e) = e~ eS T Xt+e (r* {t+e) M) E L (T x{t) (T* M)) . It is enough to prove 
that A(0) is equivalent to -0(A t ). Indeed, k(t) = e ( *°~ t)H T Xt A(0), where 

e (t -t)H :Txt (T*M)^T Xto (T*M) 

is a symplectic isomorphism. The association of the quadratic form A(t) on 
the subspace A(t) to the tangent vector A(t) E L (T At) (T*M)) is intrinsic, i.e. 
depends only on the symplectic structure on (T XtQ (T* M) . Hence A(0)(£) = 

A(f) (ei* -*^), V£ E A(0) = T Xt (T; W M). 

What remains, is to compute A(0); we do it in coordinates. We have: 

A(e) - I (£(e) r,(e)) - ^ = ^V.) + ?7 T 0(V.), ^ eRn *\ 
A ^-{^(e)). . (r) = _ d jH {Xt _ r)e _ # = j. 

A(o)(e(o)) = a ((£(o),o), (e(o),i)(o))) = e(o)i)(o) = -e(o)^(A,)e(o) T . 

□ 

Now combining Lemma 1.4 with Theorem 1.3 and Corollary 1.1 we obtain 
the following version of the classical "Morse formula" 

Corollary 1.2 Under conditions of Theorem 1.3, if {r E (to,ti] : A(r) H 
A(t ) 7^ 0} is a finite subset of (to, ti) , then 

indHessJ tl | $ _ 1(9(to)) = £ dim(A(r) n A(f„)). 

re(to,ti) 

Part II 

Geometry of Jacobi curves 

8 Jacobi curves 

Computation of the ^-derivative for regular extremals in the last section 
has led us to the construction of curves in the Lagrange Grassmannians 
which works for all Hamiltonian systems on the cotangent bundles, inde- 
pendently on any optimal control problem. Set A x = T X (T*M), where 
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A G T*M, q e M. The curve r i-> rH A eTl j^ in the Lagrange Grass- 

mannian L (T A (T*M)) is the result of the action of the flow e 4// on the vector 
distribution {A\}\ e r*M- Now we are going to study differential geometry 
of these curves; their geometry will provide us with a canonical connection 
on T*M associated with the Hamiltonian system and with curvature-type 
invariants. All that gives a far going generalization (and a dynamical inter- 
pretation) of classical objects from Riemannian geometry. 

In fact, construction of the basic invariants does not need symplectic 
structure and the Hamiltonian nature of the flow, we may deal with more 
or less arbitrary pairs (vector Geld, rank n distribution) on a 2n-dimensional 
manifold N. The resulting curves belong to the usual Grassmannian of all n- 
dimensional subspaces in the 2n-dimensional one. We plan to work for some 
time in this more general situation and then come back to the symplectic 
framework. 

In these notes we mainly deal with the case of involutive distributions 
(i.e. with n-foliations) just because our main motivation and applications 
satisfy this condition. The reader can easily recover more general definitions 
and construction by himself. 

So we consider a 2n-dimensional smooth manifold N endowed with a 
smooth foliation of rank n. Let z G N, by E z we denote the passing through 
z leaf of the foliation; then E z is an n-dimensional submanifold of N. Point 
z has a coordinate neighborhood O z such that the restriction of the foliation 
to O z is a (trivial) fiber bundle and the fibers E l z ° c , z' G O z , of this fiber 
bundle are connected components of E z < n O z . Moreover, there exists a 
diffeomorphism O z = R n x IR n , where M. n x {y}, y G W 1 , are identified with 
the fibers so that both the typical fiber and the base are diffeomorphic to IR n . 
We denote by O z /E loc the base of this fiber bundle and by tt : O z -> O z /E loc 
the canonical projection. 

Let C be a smooth vector field on N. Then z 1 \— > tt*C( z ')i z ' E l z oc is a 
smooth mapping of E l z oc to T 7r ( 2 )(0 2 / E loc ). We denote the last mapping by 
II z (():E l ° c ^T n{z) (O z /E l ° c ). 

Definition. We call ( a lifting field if Tl z (() is a constant mapping Wz G A^; 
The field ( is called regular if Tl z (() is a submersion, z G N. 

The flow generated by the lifting field maps leaves of the foliation in the 
leaves, in other words it is leaves-wise. On the contrary, the flow generated by 
the regular field "smears" the fibers over O z /E loc ; basic examples are second 
order differential equations on a manifold M treated as the vector fields on 
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the tangent bundle TM = N. 

Let us write things in coordinates: We fix local coordinates acting in 
the domain O C N, which turn the foliation into the Cartesian product 
of vector spaces: O = {(x, y) : x, y G W 1 }, it : (x,y) i— > y. Then vec- 

tor field ( takes the form ( = \ a% ~i^~ ~'"^^")' wnere are smooth 

i=l ^ l ^ 

functions on W 1 x IR n . The coordinate representation of the map IT 2 is: 
n( x ,y) : ^ l— (fr 1 2/) ) • • • 5 b n (x, y)) . Field ( is regular if and only if II^) 
are submersions; in other words, if and only if ^^-^ is a nondegenerate 

matrix. Field £ is lifting if and only if = 0, i, j = 1, . . . , n. 

Now turn back to the coordinate free setting. The fibers E Z) z & N are 
integral manifolds of the involutive distribution £ = {T Z E Z : z G iV}. Given 
a vector field ( on iV, the (local) flow generated by £, and zGiVwe define 
the family of subspaces 

J*(t) = (e~ t<: )J\ z C T 2 7V. 

In other words, J z (t) = (e~*^ T etC(z) E et<{z) , J z {0) = T Z E Z . 

J x (t) is an n-dimensional subspace of T Z N, i.e. an element of the Grass- 
mannian G n (T z N). We thus have (the germ of) a curve t h- > J 2 (t) in G n (T z N) 
which is called a Jacobi curve. 

Definition. We say that field ( is k-ample for an interger k ii^z E N and 
for any curve t h- > J 2 (t) in G n (T z N) with the same fc-jet as J 2 (t) we have 
J 2 (0) fl J 2 (t) = for all t close enough but not equal to 0. The field is called 
ample if it is /c-ample for some k. 

It is easy to show that a field is 1-ample if and only if it is regular. 

9 The cross-ratio 

Let E be a 2n-dimensional vector space, vq,Vi G G n (E), v q fl v i = 0. Than 
S = f + f i. We denote by 7r 1 , 01 , 1 : E — > i>! the projector of E onto i>i parallel 
to fo- In other words, n VQVl is a linear operator on E such that tt^J^ = 0, 
7r ^oi'i L = Surely, there is a one-to-one correspondence between pairs of 
transversal n-dimensional subspaces of E and rank n projectors in gl(S). 
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Lemma II. 1 Let v G G n (E); we set n* — {v G G„(E) : v fl v — 0} ; an 
open dense subset of G n (E). Then {tc vvo : v G Vq} is an affine subspace of 
gl{Y). 

Indeed, any operator of the form air VVo + (1 — a)n WVo , where a G R, takes 
values in no and its restriction to no is the identity operator. Hence an VVo + 
(1 — a)ir wvo is the projector of E onto no along some subspace. 

The mapping n h- > n vvo thus serves as a local coordinate chart on G n (E). 
These charts indexed by no form a natural atlas on G n (E). 

Projectors 7^ satisfy the following basic relations: 3 

where Vi G G n (E), n« fin,- = for i ^ j. If n = 1, then G n (E) is just the pro- 
jective line MP 1 ; basic geometry of Cr n (E) is somehow similar to geometry of 
the projective line for arbitrary n as well. The group GL(E) acts transitively 
on G n (E). Let us consider its standard action on (k + l)-tuples of points in 
G n (S): 

de f 

A(v , ...,v k ) =i (Avq, Av k ), A G GL(E), v t G G n (E). 

It is an easy exercise to check that the only invariants of a triple (vo,vi,V2) 
of points of G n (E) for such an action are dimensions of the intersections: 
dim(nj C\Vj), < i < 2, and dim(n Hv 1 fln 2 ). Quadruples of points possess a 
more interesting invariant: a multidimensional version of the classical cross- 
ratio. 

Definition. Let Vi G G n (E), % = 0, 1, 2, 3, and no fl v\ = n 2 fl v% = 0. The 
cross-ratio of Uj is the operator [no, vi, n 2 , V3] G gl(ni) defined by the formula: 

[vq,Vi,V2,V 3 ] = T^v v^v 2 v 3 \ Vi - 

Remark. We do not lose information when restrict the product ir VQVl ir V2V3 to 
Vi] indeed, this product takes values in v\ and its kernel contains n . 

For n = 1, v± is a line and [n , Vi, n 2 , n 3 ] is a real number. For general n, the 
Jordan form of the operator provides numerical invariants of the quadruple 
v h i = 0, 1,2,3. 

3 Numbering of formulas is separate in each of two parts of the paper 
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We will mainly use an infinitesimal version of the cross-ratio that is an 
invariant [£oi£i] sK^i) °f a P a i r °f tangent vectors £j G T v .G n (E), i = 
0, 1, where v o fl t> i = 0. Let 7j(t) be curves in G„(S) such that 7i(0) = 

^7i(0| t=0 = &» * = °> L Then the cross-ratio: [7o(t), 7i(0), 7o0~), 7i(0)] 
is a well defined operator on t>i = 71 (0) for all t, r, close enough to 0. More- 
over, it follows from (1) that [70 (0) 7i(0)> 7o(0), 7i(0)] = 
[7o(0),7i(0),7oW,7i(0)] = [70(0), 7i(0), 7o(0), 7 i(*)] = id. We set 



q2 

[eo,6] = ^[7o(t),7i(0),7o(0), 7 i(r , „ 



t=T = 



(2) 



It is easy to check that the right-hand side of (2) depends only on £o>£i and 
that (£o)£i) l— *■ [£o,£i] is a bilinear mapping from T„ G n (£) x T 1 , 1 G n (S) onto 
^Z(ui). 



Lemma II. 2 Let vq,vi G G n (E), t> fl t>i = 0, & G T^G n (S), and & 



7i(0| 



t=o' 



0,1. T/^en [CcCi] 



8 2 

dtdr 71 (tho(r) 



vi 



invariant subspaces of the operator ■^^^■y 1 (t)-y (r) 



t=T = 
t=T=0 



and v±, Vq are 



Proof. According to the definition, [f ,fi] = aft: (^70 W71 (0)^70 (0)71 (r)) | t 



The differentiation of the identities ^(^(o^o^biO-) 
7r 7oW7i(^) 7r 7o(o)7i(r) = ^70(0)71 (r) S iv es the equalities: 



t=T=0 



7r 7oW7l(0), 



d 2 



dtdr 



,7T, 



70(<)7l(0) 7r 70(0)7i(r), 



<9 2 



— 7T. 



t=T=0 



t) fl 



<9t<9r 



7T 



7o(*)7l W 



t=r=0 



9 2 



dtdr 



IT. 



7o(*)7i M 



7T, 



t=r=0 



It remains to mention that ^7 7r 7i(t)7 (r) 



a 2 _ n 

"dtdr 7r 7o(-r)7l(*)- LJ 



10 Coordinate setting 

Given v { G G n (E), i — 0,1, 2, 3, we coordinatize £ = lR n x W n = {(x, y) : x G 
M n ,y G M n } in such a way that v-i fl {(0,y) : y G M n } = 0. Then there exist 
n x n-matrices Si such that 

f i = {(x, Six) : x G 1R™}, i = 0,l,2,3. (3) 
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The relation Vi fl Vj — is equivalent to det(5 , i — Sj) ^ 0. If So = 0, then the 






S^ 1 

projector ir VoVl is represented by the 2n x 2n-matrix ( n \ ) . In general, 



we have 



c— 1 C G~ 1 

°oi °o ~>->m 
S\S Q1 S$ —S\S 01 



where Soi = So — S\. Relation (3) provides coordinates {x} on the spaces V{. 
In these coordinates, the operator [vo, i>i, i>2, Vs] on v\ is represented by the 
matrix: 

[v ,v 1 ,v 2 ,v 3 ] = Sfo 1 ^^ 1 ^!, 

where S^- — Si — Sj. 

We now compute the coordinate representation of the infinitesimal cross- 
ratio. Let 7„(t) = {(x, Stx) : x G R n }, 71 (t) = {(x, Si +t x) :iGl"} so that 
& = ^^*(*)L=o * s re P resen ted by the matrix = ;|<St| t=i , 2 = 0,1. Then 
[£oi£i] is represented by the matrix 



d 



■2 



dtdr 



Q—l Q Q— 1 C 



(9 • ■ 

q— 1 c c — 1 c c — 1 c 

— -^TiJit *->i — »Joi D o»Joi °i- 



^ dt 



t=0 



So 

[So,ti]=S£S S£S 1 . (4) 

There is a canonical isomorphism T^ G n (S) = Hom(t> , £/t>o); it is defined 
as follows. Let £ G T„ G n (E), £ = J^7(t)| t= o, and z G t>o- Take a smooth 
curve G 7(t) such that z(0) = £ . Then the residue class (i(0) + i>o) G 
E/v depends on £ and z rather than on a particular choice of j(t) and 
Indeed, let ^'(t) be another curve in G n (E) whose velocity at t — 
equals £. Take some smooth with respect to t bases of ^(t) and ^'(t): 
7(t) = span{ei(t), . . . , e n (t)}, i(t) = span{e[(t), . . . , e' n (t)}, where e^O) = 
e^(0), i = l,...,n; then (ej(0) — e^(0)) G t>o, i = l,...,n. Let = 

n n 

£ ai(t)ei(t), /(t) = £ a4(tK(t), where a^O) = aj(0). We have: 
i=i i=i 

n 

i(0) - i'(0) = ]T ((^(0) - dj(0))ei(0) + «;(0)(e,(0) - ej(0))) G i*, 
i=i 

i.e. i(0) +v = i'(0) +u - 

We associate to £ the mapping £ : t> ~~ * ^/ v o defined by the formula 
t;z = i(0) + v . The fact that £ — > £ is an isomorphism of the linear 
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spaces T V0 G n (E) and Hom(t> , S/t> ) can be easily checked in coordinates. 
The matrices Si above are actually coordinate presentations of i — 0, 1. 

The standard action of the group GL(E) on G n (E) induces the action 
of GL(E) on the tangent bundle TG n (E). It is easy to see that the only 
invariant of a tangent vector £ for this action is rank£ (tangent vectors are 
just "double points" or "pairs of infinitesimaly close points" and number 
(n — rank£) is the infinitesimal version of the dimension of the intersection 
for a pair of points in the Grassmannian) . Formula (4) implies: 

rank[£ ,£i] < min{rank£ , rank£i}. 

11 Curves in the Grassmannian 

Let 1 1— > v(t) be a germ at i of a smooth curve in the Grassmannian G n (E). 

Definition. We say that the germ v(-) is ample if v(t) fl v(t) = Vi ^ t 
and the operator-valued function t i— > 7r„( t )„m has a pole at t. We say that 
the germ v(-) is regular if the function t i— > n v (t)v{t) has a simple pole at t. 
A smooth curve in G n (E) is called ample (regular) if all its germs are ample 
(regular). 

Assume that E = {(x, y) : x, y e W 1 } is coordinatized in such a way that 
v(t) = {(x,0) : x G M n }. Then = {(a;, Star) : a: G W 1 }, where = 

and n v (t) v (i) = ( q q* ) • ^ ne g erm v (') i s ample if and only if the scalar 

function t i— > det S't has a finite order root at t. The germ t>(-) is regular if 
and only if the matrix Si is not degenerate. More generally, the curve r i— > 
{(x, S't-x) : a; G M n } is ample if and only if Vt the function r i— > det(S' r — S 1 *) 
has a finite order root at t. This curve is regular if and only if det St ^ 0, Vt. 
The intrinsic version of this coordinate characterization of regularity reads: 
the curve v(-) is regular if and only if the map v(t) G Hom(v(t), Yl/v(t)) has 
rank n, Vt. 

Coming back to the vector fields and their Jacobi curves (see Sec. 8) one 
can easily check that a vector field is ample (regular) if and only if its Jacobi 
curves are ample (regular). 

Let v (•) be an ample curve in G n (E). We consider the Laurent expansions 
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at t of the operator- valued function r i— > 7T„( T )„(t) , 



rn 



7T, 



■(rMt)= 5^-^ + 0(r-t) 



m+l 



i=-k t 



Projectors of S on the subspace t> (t) form an affine subspace of gl(£) (cf. 
Lemma III). This fact implies that 7r° is a projector of E on in other 
words, 7r° = ^0^^^) for some v°(t) G We thus obtain another curve 

* i-> u°(t) in G n (£), where £ = ©u°(t), Vt. The curve t ^ v°(t) is called 
the derivative curve of the ample curve v(-). 

The affine space {n wv (t) '■ w G v(t)^} is a translation of the linear space 
y\(v(t)) = {n : E — > f (£) I n|„( t ) = 0} C gl(S)} containing only nilpotent 
operators. It is easy to see that n\ G 9T(-u(t)) for % 7^ 0. 

The derivative curve is not necessary ample. Moreover, it may be nons- 
mooth and even discontinuous. 

Lemma II. 3 Ifv(-) is regular then v°(-) is smooth. 

Proof. We'll find the coordinate representation of v°(-). Letv(t) = {(x,S t x) : 
x G M n }. Regularity of v(-) is equivalent to the nondegeneracy of S t . We 



where S Tt = S T - S t . Then S^ 1 = (r - t)' 1 ^ 1 - ^S^StSf 1 + 0(r - i) as 



have: 




r 



t and 



^v(r)v(t) = ( T ~t) 




St — 




+ 




is 



(5) 
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12 The curvature 



Definition. Let v be an ample curve and v° be the derivative curve of 
v. Assume that v° is differentiable at t and set R v (t) = [v°(t),v(t)]. The 
operator R v (t) G gl{v{t)) is called the curvature of the curve v at t. 

If v is a regular curve, then v° is smooth, the curvature is well-defined 
and has a simple coordinate presentation. To find this presentation, we'll 
use formula (4) applied to £o = v°(t), £1 = v(t). As before, we assume that 
v(t) = {(x,S t x) : x G M n }; in particular, v(t) is transversal to the subspace 
{(0, y) : y G M n }. In order to apply (4) we need an extra assumption on the 
coordinatization of S: the subspace v°(t) has to be transversal to {(0, y) : 
y G W 1 } for given t. The last property is equivalent to the nondegeneracy 
of the matrix A t (see (6)). It is important to note that the final expression 
for R v (t) as a differential operator of S must be valid without this extra 
assumption since the definition of R v (t) is intrinsic! Now we compute: v°(t) = 
{(x, (A^ 1 + S t )x) : x G M n }, R v (t) = [v°(t),v(t)] = MMt' + S t)AS t = 
{A t S t f - A t S t = KSf'St) 2 - A t S t . We also have AS = -Ij-^SS-^S = 
(S- 1 ) 2 - IS" 1 S. Finally, 

Rv(t) = ifiT 1 St —(Sr'St) 2 = j t ((2^)- 1 ^) - ((25 t )- 1 ^) 2 , (6) 

the matrix version of the Schwartzian derivative. 

Curvature operator is a fundamental invariant of the curve in the Grass- 
mannian. One more intrinsic construction of this operator, without using 
the derivative curve, is provided by the following 

Proposition II. 1 Let v be a regular curve in G n (E). Then 
[v(r),v(t)} = (T-ty 2 id+\R v (t) + 0(T-t) 

as t — > t. 

Proof. It is enough to check the identity in some coordinates. Given t we 
may assume that 

v(t) = {(x,0) : x G R n }, v°(t) = {(0,y) : y G R n }. 

Let v(t) = {(x,S T x : x G lR n }, then S t = S t = (see (5)). Moreover, we 
may assume that the bases of the subspaces v(t) and v°(t) are coordinated 
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in such a way that S t = I. Then R v (t) — \ St (see (6)). On the other hand, 
formula (4) for the infinitesimal cross-ratio implies: 

[v(r),v(t)] = s; 1 s T s; 1 = -^(s; 1 ) = 



^ (V - t)- l I - St^j + 0(t -t) = (r- t)~ 2 I + l - St +0(r - t). 



dr 

( (r-t)- 1 

dr 



□ 

Curvature operator is an invariant of the curves in G n (E) with fixed 
parametrizations. Asymptotic presentation obtained in Proposition II. 1 im- 
plies a nice chain rule for the curvature of the reparametrized curves. 

Let tp : K. — > IR be a regular change of variables, i.e. 0^0, Vl The 
standard imbedding IR C IRP 1 = Gi(IR 2 ) makes ip a regular curve in Gi(IR 2 ). 
As we know (see (6)), the curvature of this curve is the Schwartzian of <p: 

We set v v {t) = v((p(t)) for any curve v in G n (E). 

Proposition II. 2 Let v be a regular curve in G ra (S) and ip : IR — > IR be a 

regular change of variables. Then 

R Vv {t) = ^ 2 (t)R v ( V (t)) + R^t). (7) 

Proof. We have 

[^(r),^(t)] = ( T -t)- 2 id + ^R Vv (t) + 0(r-t). 
On the other hand, 

[i v (T),V v (t)] = [&(T)v(<p(T)),<p(t)v(<p(t))] = ip(T)ip(t)[v(ip(T)),v(ip(t))\ = 

<p(r)ip(t) Up(r) - cp(t)y 2 id + ^R v (<p(t)) + 0(r - t) \ = 
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((^)-^)) J 3 
We treat <p as a curve in MP 1 = G^M 2 ). Then [(p(r),(p(t)] = (jff^L , 
see (4). The one-dimensional version of Proposition II. 1 reads: 

[#t), = {t~ r)- 2 + ij^(f) + 0(r - t). 

Finally, 

Mr), v v {t)\ ={t- r)- 2 + i (i^(t) + 2 (t)i?^(t))) + 0(r - t). □ 
The following identity is an immediate corollary of Proposition II. 2: 

R V(p - l(trit^)idj (t) = <p 2 (t) \Rv - ^(tvR v )id\ (cp(t)). (8) 

Definition. An ample curve v is called flat if Ry(t) = 0. 

It follows from Proposition II. 1 that any small enough piece of a regular 
curve can be made flat by a reparametrization if and only if the curvature 
of the curve is a scalar operator, i.e. R v (t) = ^(tiR v (t))id. In the case 
of a nonscalar curvature, one can use equality (8) to define a distinguished 
parametrization of the curve and then derive invariants which do not depend 
on the parametrization. 

Remark. In this paper we are mainly focused on the regular curves. See 
paper jB] for the version of the chain rule which is valid for any ample curve 
and for basic invariants of unparametrized ample curves. 



13 Structural equations 

Assume that v and w are two smooth curves in G n (S) such that 
v(t) n w{t) = 0, Vt. 

Lemma II. 4 For any t and any e G v(t) there exists a unique f e G w(t) 
with the following property: 3 a smooth curve e T G v(r), e t = e, such that 
J^e T | = f e . Moreover, the mapping ^ w : e i— > f t is linear and for any 
eo G v(0) there exists a unique smooth curve e(t) G v(t) such that e(0) = eo 
and 

e{t) = $re(t), W. (9) 
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Proof. First we take any curve e T G v{r) such that e t = e. Then e T = a T + b T 
where a T G v(t), b T G w(t). We take x T G v{r) such that x t = a t and set 
e T = e T + (t — r)x T . Then e< = fej and we put f e = b t . 

Let us prove that b t depends only on e and not on the choice of e T . 
Computing the difference of two admissible e T we reduce the lemma to the 
following statement: if z(t) G v(t), Vt and z(t) = 0, then z(t) G v(t). 

To prove the last statement we take smooth vector-functions e\ G i>(t), i = 

n 

1, . . . , n such that v(t) = spaniel, . . . , e"}. Then z(t) = Yl a i( T ) e Ti a i(t) — 

i=i 

n 

0. Hence z(t) = ®i(t) e t e v t- 
i=i 

Linearity of the map follows from the uniqueness of f e . Indeed, if 
f e i = jpK\ T=t ] then ^( a i e r+"2e?)| r=t = aif e i+a 2 f e ^, hence aif e i+a 2 f e ^ = 
/a ie i+a 2 e 2 , Ve* G «j G R, i = 1,2. 

Now consider the smooth submanifold V = {(t, e) : i G R, e G w(t)} of 
KxS. We have (1, $™e) G T (t)e) y since (1, $^e) is the velocity of a curve 
r i— > (t, e T ) in V. So (£, e) i— > (l,$" TO e), (t, e) G V is a smooth vector field 
on V. The curve e(t) G f (t) satisfies (9) if and only if (t, e(t)) is a trajectory 
of this vector field. Now the standard existence and uniqueness theorem for 
ordinary differential equations provides the existence of a unique solution to 
the Cauchy problem for small enough t while the linearity of the equation 
guarantees that the solution is defined for all t. □ 

It follows from the proof of the lemma that $J™e = ^v{t)w{t) e r\ T _ t for 
any e T G v(t) such that v t = e. Let v(t) = {(x,S vt x) : x G M n }, w(t) = 
{(x,S wt x) : x G M. n }; the matrix presentation of <& v t w in coordinates x is 
(S wt — Svt^Svt. Linear mappings <& v t w and provide a factorization of 
the infinitesimal cross-ratio [w(t),v(t)]. Indeed, equality (4) implies: 

[w(t),v(t)] = -*r*r- 

Equality (9) implies one more useful presentation of the infinitesimal cross- 
ratio: if e(t) satisfies (9), then 

[w(t),v(t)W) = -®7 v ®t w e(t) = -$r = -M*M*) g (*)- l 11 ) 

Now let w be the derivative curve of v, w(t) = v°(t). It happens that 
e(t) G v(t) in this case and (11) is reduced to the structural equation: 

e(t) = -[v°(t),v(t)]e(t) = -R v (t)e(t), 
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where R v (t) is the curvature operator. More precisely, we have the following 

Proposition II. 3 Assume that v is a regular curve in G ra (E) 7 v° is its 
derivative curve, and e(-) is a smooth curve in E such that e{t) G v (t), Wt. 
Then e{t) G v°(t) if and only if e(t) G v (t). 

Proof. Given t, we take coordinates in such a way that v (t) = {(x, 0) : x G 
R n }, u°(t) = {(0,y) : y G M n }. Then u(r) = {(x, S T :r) : x G M n } for r close 
enough to t, where S t — S t — (see (5)). 

Let e(r) = {(x(r), SVa^r))}. The inclusion e(t) G t>°(t) is equivalent to 
the equality x(t) = 0. Further, 

e(t) = {x(t), S t x(t) + 2S t x(t) + S t x(t)} = {x(t), 2Sx} G v(t). 

Regularity of v implies the nondegeneracy of S(t). Hence e(t) G v(t) if and 
only if x(t) = 0. □ 

Now equality (11) implies 

Corollary II.l If e(t) = <$> v t v °e(t), then e(t) + R v {t)e{t) = 0. 

Let us consider invertible linear mappings V t : v(0) — > v(t) defined by 
the relations V t e(0) = e(t), e(r) = $™°e(r), < r < t. It follows from 
the structural equation that the curve v is uniquely reconstructed from v(0) 
and the curve * i-> V^Rvit) in gl(u(0)). Moreover, let v G G„(E) and ^ G 
Tu (jn(E), where the map £ G Hom(t> , E/f ) has rank n; then for any smooth 
curve t I— > A(t) in gl(i>o) there exists a unique regular curve f such that 
v(0) = £ and Vf 1 R v (t)V t = A(t). Indeed, let ej(0), i = 1, . . . , n, be a basis 

n 

of f and A(t)ei(0) = a ij(t) e j(®)- Then v(t) = span{ei(t), . . . , e n (t)} : 

3=1 

where 

n 

e,(r) + ^a^(r)e,(r) = 0, < r < t, (12) 

are uniquely defined by fixing the w(0). 

The obtained classification of regular curves in terms of the curvature is 
particularly simple in the case of a scalar curvature operators R v (t) = p(t)id. 
Indeed, we have A(t) = Vf 1 R v (t)V t = p(t)id and system (12) is reduced to 
n copies of the Hill equation e(r) + p(r)e(r) = 0. 

Recall that all £ G TG n (E) such that rank£ = n are equivalent under the 
action of GL(E) on TG n (E) induced by the standard action on the Grass- 
mannian G n (E). We thus obtain 
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Corollary II. 2 For any smooth scalar function p(t) there exists a unique, 
up to the action of GL(E), regular curve v in G n (E) such that R v (t) = p(t)id. 

Another important special class is that of symmetric curves. 

Definition. A regular curve v is called symmetric if VtR v (t) = R v (t)Vt, Vi. 

In other words, v is symmetric if and only the curve A(t) = V^ 1 R v (t)V t 
in gl(t> (0)) is constant and coincides with R v (0). The structural equation 
implies 

Corollary II. 3 For any n x n-matrix A , there exists a unique, up to the 
action of GL(T,), symmetric curve v such that R v (t) is similar to A . 

The derivative curve v° of a regular curve v is not necessary regular. 
The formula R v (t) = ^° v ^ v ° implies that v° is regular if and only if the 
curvature operator R v (t) is nondegenerate for any t. Then we may compute 
the second derivative curve v°° = (v°)°. 

Proposition II. 4 A regular curve v with nondegenerate curvature operators 
is symmetric if and only if v°° = v. 

Proof. Let us consider system (12). We are going to apply Proposition II. 3 
to the curve v° (instead of v) and the vectors e«(t) G v°(t). According to 
Proposition II. 3, v°° = v if and only if J^-e^(i) G v°(t). Differentiating (12) 
we obtain that v°° — v if and only if the functions etij(t) are constant. The 
last property is none other than a characterization of symmetric curves. □ 

14 Canonical connection 

Now we apply the developed theory of curves in the Grassmannian to the 
Jacobi curves J z (t) (see Sec. 8). 

Proposition II. 5 All Jacobi curves J z (-), z G N, associated to the given 
vector field ( are regular (ample) if and only if the field ( is regular (ample). 

Proof. The definition of the regular (ample) field is actually the specifica- 
tion of the definition of the regular (ample) germ of the curve in the Grass- 
mannian: general definition is applied to the germs at t — of the curves 
1 1— > J z {t). What remains is to demonstrate that other germs of these curves 
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are regular (ample) as soon as the germs at are. The latter fact follows 
from the identity 

Ut + r) = e;* c J et c( 2) (T) (13) 

(which, in turn, is an immediate corollary of the identity e^ <+r ' ) '' = e~ tc - o 
e~ T ^). Indeed, (13) implies that the germ of J z {-) at t is the image of the 
germ of J e *c( T )(-) at under the fixed linear transformation e~ tc - : T e tc^N — > 
T Z N. The properties of the germs to be regular or ample survive linear 
transformations since they are intrinsic properties. □ 

Let ( be an ample field. Then the derivative curves J°(t) are well-defined. 
Moreover, identity (13) and the fact that the construction of the derivative 
curve is intrinsic imply: 

J°,(t) = e;«r# w (0). (14) 

The value at of the derivative curve provides the splitting T Z M = J z (0) © 
J°(0), where the first summand is the tangent space to the fiber, J z (0) = 
T Z E Z . 

Now assume that J°(t) smoothly depends on z; this assumption is au- 
tomatically fulfilled in the case of a regular (, where we have the explicit 
coordinate presentation for J°{t). Then the subspaces J°(0) C T Z N, z G N, 
form a smooth vector distribution, which is the direct complement to the ver- 
tical distribution £ = {T Z E Z : z G iV}. Direct complements to the vertical 
distribution are called Ehresmann connections (or just nonlinear connections, 
even if linear connections are their special cases). The Ehresmann connection 
£^ = {J°(0) : z G N} is called the canonical connection associated with £ 
and the correspondent splitting TN — £ ©£^ is called the canonical splitting. 
Our nearest goal is to give a simple intrinsic characterization of £q which 
does not require the integration of the equation z = ((z) and is suitable for 
calculations not only in local coordinates but also in moving frames. 

Let JF = {F z C T Z N : z G N} be an Ehresmann connection. Given a 
vector field f on E we denote £ ve r(z) = ^f z j z (o)^, thor(z) = ^j z (o)f z ^ tlie 
"vertical" and the "horizontal" parts of £(z). Then £ = £„ er + ^ hor , where 
£„ er is a section of the distribution £ and ^ l0 r is a section of the distribution 
T . In general, sections of £ are called vertical fields and sections of T are 
called horizontal fields. 

Proposition II. 6 Assume that ( is a regular field. Then T = £^ if and only 
if the equality 

[C» [C» U ]]hor = 2[C, [C v\ver\hor (15) 
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holds for any vertical vector field v. Here [,] is Lie bracket of vector fields. 

Proof. The deduction of identity (15) is based on the following classical 
expression: 

|e;^ = e^[C,e (16) 

for any vector field £. 

Given z G N, we take coordinates in T Z N in such a way that T Z N = 
{(x,y) :x,ye R n }, where J z (0) = {(x, 0) : x G W 1 }, J°(0) = {(0,y) : y G 
R n }. Let J z (t) = {(x, S t x) : a: G M n }, then S = S = and det 5 ^ due 
to the regularity of the Jacobi curve J z . 

Let z/ be a vertical vector field, v(z) = (x ,0) and (e^iA (z) = (x t ,y t ). 
Then (x t , 0) = (e"*^)^ (z), (0,y t ) = (e~* c ^)^ or (2). Moreover, y t = 5 t a; t 
since (e~*^i/) (2) G J 2 (i). Differentiating the identity y t = S^Xt we obtain: 
y 4 = SfX t + S t x t . In particular, y — S x Q . It follows from (16) that (x , 0) = 
[C, i/]„ er , (0,y ) = [C, ^W- Hence (0, £0X0) = [C, v]hor(z), where, I recall, v is 
any vertical field. Now we differentiate once more and evaluate the derivative 
at 0: 

y = S o x + 2S x + S x = 2S x . (17) 

The Lie bracket presentations of the left and right hand sides of (17) are: 
(0,y ) = [C,[C» u ]]hor, (0,S x ) = [C, [(,v]ver]hor- Hence (17) implies identity 
(15). 

Assume now that {(0, y) : y G M n } 7^ </°(0); then Sqx q 7^ for some x . 
Hence jjo 7^ 25*0X0 and equality (15) is violated. □ 

Inequality (15) can be equivalently written in the following form that is 
often more convenient for the computations: 

**[U{M](*)=2**[{,KMver](z), V^GiV. (18) 

Let Rj z (t) G g\(J z (t)) be the curvature of the Jacobi curve J z (t). Identity 
(13) and the fact that construction of the Jacobi curve is intrinsic imply that 

RjM = e;«Rj etc jo)e?\ Mty 

Recall that J z (0) = T Z E Z ; the operator Rj z (0) G g\(T z E z ) is called the curva- 

def 

ture operator of the field ( at z. We introduce the notation: R((z) = Rj z (0); 
then R^ = {R^(z)} z&E is an automorphism of the "vertical" vector bundle 
{T z E z } zeM . 
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Proposition II. 7 Assume thatC, is an ample vector field and J°(0) is smooth 
with respect to z. Let TN — £ © £^ be the canonical splitting. Then 

= -[(, [C,v]hor]ver (19) 

for any vertical field v . 

Proof. Recall that Rj z (0) = [ J°(0), J z (0)}, where [•, •] is the infinitesimal 
cross-ratio (not the Lie bracket!). The presentation (10) of the infinitesimal 
cross-ratio implies: 

r^z)=r,jM = -<° Jz < zJ % 

where $Q W e = 7r„( )„,(o)eo for any smooth curve e T G v(t) such that e = e. 
Equalities (14) and (16) imply: <$> J Q zJ °v(z) = [(,v] ver (z), G M. Similarly 

jo j 

$ Z z fi(z) = [(, fj]hor{ z ) f° r an Y horizontal field ji and any z G M. Finally, 

R ( {z)is(z) = -<° Jz < zJ ° = -[C, [C, v]hor]ver{z)- □ 

15 Coordinate presentation 

We fix local coordinates acting in the domain O C N, which turn the foliation 
into the Cartesian product of vector spaces: O = {(x,y) : x,y G W 1 }, n : 

(x, y) i— > y. Then vector field ( takes the form ( = ( a *^~ + ) ' wriere 

a 1 , b % are smooth functions on IR n x R n . Below we use abridged notations: 
— d x ., ^r- = (p x . etc. We also use the standard summation agreement for 

axi Xl ' oxi r x t o 

repeating indices. 

Recall the coordinate characterization of the regularity property for the 
vector field (. Intrinsic definition of regular vector fields is done in Section 8; 
it is based on the mapping U z whose coordinate presentation is: : x i— > 

(6 1 (a:, y), . . . , b n (x, y)) 1 . Field ( is regular if and only if U y are submersions; 

in other words, if and only if ( b\ ) is a non degenerate matrix. 

V Xj Ji,j=i 

Vector fields d Xi , i = 1, . . . ,n, provide a basis of the space of vertical 
fields. As soon as coordinates are fixed, any Ehresmann connection finds a 
unique basis of the form: 

( d Vi)hor = d Vi+ C i d *V 
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where c-, i,j = \,...,n, are smooth functions on R n x W 1 . To characterize 
a connection in coordinates thus means to find functions cj. In the case of 
the canonical connection of a regular vector field, the functions c? { can be 
easily recovered from identity (18) applied to v — d Xi , i — 1, . . . , n. We'll do 
it explicitly for two important classes of vector fields: second order ordinary 
differential equations and Hamiltonian systems. 
A second order ordinary differential equation 

y = x, x = f(x,y) (20) 

there corresponds to the vector field ( = f % d Xi +Xid yi1 where / = (/i, . . . , / n ) T . 
Let v = d Xt ; then 

[(,is} = -d yi - ft.d Xj , [C, V]ver = (4 - f 3 x% )d Xj , 
MC, [C, V\\ = f 3 x .3 yj , 7T*[C, [C, V\ver] = (f Xi ~ 4) d y 3 - 

Hence, in virtue of equality (18) we obtain that d L = for the canonical 
connection associated with the second order differential equation (20). 

Now consider a Hamiltonian vector field ( = —h yi d Xi + h Xi d yi , where h 
is a smooth function on M n x R n (a Hamiltonian). The field ( is regular if 
and only if the matrix h xx = (h XiXj )™ ' is non degenerate. We are going to 

characterize the canonical connection associated with (. Let C = (cf')™. =1 ; 
the straightforward computation similar to the computation made for the 
second order ordinary differential equation gives the following presentation 
for the matrix C: 

2 ( /? y> rp f rp J - ■ j }' /y ^ ly /y . rp .ft, \\li> l , f^/iy • rp . rp . rp .111, ^t/rp , rp . 'f . rp , , T ■ 

or, in the matrix form: 

^h xx Ch xx {/i, h xx \ h xy h xx h xx h yx , 
where {h, h xx ] is the Poisson bracket: {h, h xx }ij = {h, h XiXj } = h Xk h XiXjyk — 

^Vk ^^i^j^k ' 

Note that matrix C is symmetric in the Hamiltonian case (indeed, 
hxxh y x = {h xy h xx ) T ). This is not occasional and is actually guaranteed by the 
fact that Hamiltonian flows preserve symplectic form dxiAdi/i. See Section 17 
for the symplectic version of the developed theory. 
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As soon as we found the canonical connection, formula (19) gives us 
the presentation of the curvature operator although the explicit coordinate 
expression can be bulky. Let us specify the vector field more. In the case of 
the Hamiltonian of a natural mechanical system, h(x,y) = ^\x\ 2 + U(y), the 
canonical connection is trivial: = 0; the matrix of the curvature operator 

is just Uyy. 

Hamiltonian vector field associated to the Hamiltonian 
h(x,y) = g^(y)xiXj with a non degenerate symmetric matrix (<7 l3 ')™- =1 gen- 
erates a (pseudo-)Riemannian geodesic flow. Canonical connection in this 
case is classical Levi Civita connection and the curvature operator is Ricci 
operator of (pseudo-)Riemannian geometry (see |U Sec. 5] for details). Fi- 
nally, Hamiltonian h(x,y) = g^(y)xiXj + U(y) has the same connection as 
Hamiltonion h(x, y) = g l ^{y)x,iXj while its curvature operator is sum of Ricci 
operator and second covariant derivative of U. 

16 Affine foliations 

Let [£] be the sheaf of germs of sections of the distribution £ = {T Z E Z : 
z G iV} equipped with the Lie bracket operation. Then [£} z is just the Lie 
algebra of germs at z G M of vertical vector fields. Affine structure on the 
foliation E is a sub-sheaf [£} a C [£} such that [£} a z is an Abelian sub-algebra 
of [£] and {<j(z) : <r G [£]"} = T Z E Z , \/z G N. A foliation with a fixed affine 
structure is called the affine foliation. 

The notion of the affine foliation generalizes one of the vector bundle. In 
the case of the vector bundle, the sheaf [£] a is formed by the germs of vertical 
vector fields whose restrictions to the fibers are constant (i.e. translation 
invariant) vector fields on the fibers. In the next section we will describe an 
important class of affine foliations which is not reduced to the vector bundles. 

Lemma II. 5 Let £ be an affine foliation, <j G [£]" and q(z) = 0. Then 

?k = o. 

Proof. Let q%, . . . , <j n G be such that <;i(z), . . . , <; n (z) form a basis of T Z E Z . 
Then q = biSx + ■ • • + b n (; n , where bi are germs of smooth functions vanishing 
at z. Commutativity of [£] a z implies: = [<s,?] = (q6i)?i + ■ • ■ + {^ib n )q n . 
Hence functions h\E z are constants, i.e. h\E z — 0, i — 1, . . . , n. □ 

Lemma II. 5 implies that q G [£\ a z is uniquely reconstructed from q(z). 
This property permits to define the vertical derivative of any vertical vector 
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field v on M. Namely, Vt> G T Z E Z we set 

D v v = [q, u](z), where q G [£]° , q(z) = v. 

Suppose ( is a regular vector field on the manifold N endowed with the 
affine n-foliation. The canonical Ehresmann connection 8$ together with the 
vertical derivative allow to define a canonical linear connection V on the 
vector bundle 8. Sections of the vector bundle 8 are just vertical vector 
fields. We set 

= [£, u) ver + D u (£ ver ), 

where £ is any vector field on iV and v is a vertical vector field. It is easy 
to see that V satisfies axioms of a linear connection. The only non evident 
one is: V^z/ = bV^u for any smooth function b. Let z G iV , q G [£]", and 
q(z) = v{z). We have 

= [b£, u) ver + b£ ver ) = 

Hence 

(V K U)(Z) = b(z) {[^Mver{z) + [?,&*,-](*)) = (&V^)(z). 

Linear connection V gives us the way to express Pontryagin characteristic 
classes of the vector bundle 8 via the regular vector field (. Indeed, any linear 
connection provides an expression for Pontryagin classes. We are going to 
briefly recall the correspondent classical construction (see [T3] for details). 
Let i? v = [Vf , Vfj] — V[£ lJ? ] be the curvature of linear connection V. 
Then i? v (£,?7)z/ is C°°(M)-linear with respect to each of three arguments 
€,T},u. In particular, R v (-, -)u{z) G f\(T*N) © T Z E Z , z G N. In other 
words, i? v (-,-) G Hom(£,/\ 2 (T*A0 ®£). 

Consider the commutative exterior algebra 

N = C°°(N) © /\ (T*N) © • • • © /\ (T*N) 

of the even order differential forms on N. Then i? v can be treated as an endo- 
morphism of the module f\ ev N © 8 over algebra /\ ev N, i. e. 
i? v G Endy\™ N (/\ ev M © 8). Now consider characteristic polynomial 

_ n 

det(tl + ^FF) =t n + J2 <Pit n ~\ where the coefficient <p j is an order 2i dif- 

i=l 

ferential form on N. All forms 0« are closed; the forms 02fe-i are exact and 
the forms 02fc represent the Pontryagin characteristic classes, k = 1, . . . , [|]. 
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17 Symplectic setting 



Assume that N is a symplectic manifold endowed with a symplectic form 

a. Recall that a symplectic form is just a closed non degenerate differential 

2- form. Suppose E is a Lagrange foliation on the symplectic manifold (N, a); 

this means that u\e z = 0, Vz 6 N. Basic examples are cotangent bundles 

endowed with the standard symplectic structure: iV = T*M, E z = T*, z sM, 

where 7r : T*M — > M is the canonical projection. In this case a = dr, where 

t = {t z : z G T*M} is the Liouville 1-form on T*M defined by the formula: 

t z = z o 7r*. Completely integrable Hamiltonian systems provide another 

important class of Lagrange foliations. We'll briefly recall the correspondent 

terminology. Details can be found in any introduction to symplectic geometry 

(for instance, in [TUj). 

Smooth functions on the symplectic manifold are called Hamiltonians. 

To any Hamiltonian there corresponds a Hamiltonian vector field h on M 

defined by the equation: dh = a(-,h). The Poisson bracket {hi,h 2 } of the 

Hamiltonians hi and h 2 is the Hamiltonian defined by the formula: {hi, h 2 } = 

a(hi,h 2 ) = h\h 2 . Poisson bracket is obviously anti-symmetric and satisfies 

the Jacobi identity: {hi, {h 2 , h 3 }} + {h 3 , {hi, h 2 }} + {h 2 , {h s , hi}} = 0. This 

identity is another way to say that the form a is closed. Jacobi identity 

> -> -, 

implies one more useful formula: {hi, h 2 } = [hi, h 2 }. 

We say that Hamiltonians hi, . . . , h n are in involution if {hi, hj} = 0; then 
hj is constant along trajectories of the Hamiltonian equation i = hi(z), i,j = 
1, . . . , n. We say that hi, . . . ,h n are independent if d z hi A ■ • • A d z h n ^ 0, z G 
N. n independent Hamiltonians in involution form a completely integrable 
system. More precisely, any of Hamiltonian equations z = hi(z) is completely 
integrable with first integrals hi, ... , h n . 

Lemma II. 6 Let Hamiltonians hi, . . . ,h n form a completely integrable sys- 
tem. Then the n- foliation E z = {z' G M : hi(z') — hi(z), i = 1, . . . , n}, z G 
N, is Lagrangian. 

Proof. We have hihj = 0, i,j = l,...,n, hence hi(z) are tangent to E z . 
Vectors hi(z), . . . , h n (z) are linearly independent, hence 

span{hi(z), . . . , h n (z)} = T Z E Z . 

Moreover, a(hi, hj) = {hi, hj} = 0, hence <j\e z =0. □ 
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Any Lagrange foliation possesses a canonical affine structure. Let [£} be 
the sheaf of germs of the distribution £ = {T Z E Z : z G iV} as in Section 16; 
then [£] a is the intersection of [£] with the sheaf of germs of Hamiltonian 
vector fields. 

We have to check that Lie algebra [£] a z is Abelian and generates T Z E Z , Vz G 

N. First check the Abelian property. Let hi,h 2 G [£]"; we have [hi,h 2 \ = 
> -, -, -. 

{hi, h 2 }, {hi, h 2 } = cr{hi, h 2 ) = 0, since hi are tangent to E z and o\e z = 0. 
The second property follows from the Darboux-Weinstein theorem (see [TU] ) 
which states that all Lagrange foliations are locally equivalent. More pre- 
cisely, this theorem states that any z G M possesses a neighborhood O z and 
local coordinates which turn the restriction of the Lagrange foliation E to 
O z into the trivial bundle W 1 x W 1 = {(x,y) : x, y G E n } and, simulta- 

n 

neously, turn a\o z into the form dxi A dyi. In this special coordinates, 

i=i 

the fibers become coordinate subspaces 1R™ x {y}, y G E n , and the required 
property is obvious: vector fields ^- are Hamiltonian fields associated to the 
Hamiltonians —y%, i = 1, . . . ,n. 

Suppose C is a Hamiltonian field on the symplectic manifold endowed 
with the Lagrange foliation, ( = h. Let ? G [£ ]" , <; — s ; then qh = {s,h}. 
The field h is regular if and only if the quadratic form s i— > {s, {s, h}}(z) has 
rank n. Indeed, in the 'Darboux-Weinstein coordinates' this quadratic form 
has the matrix {^-}«. =1 . 

Recall that the tangent space T Z N to the symplectic manifold iV is a sym- 
plectic space endowed with the symplectic structure o z . An n-dimensional 
subspace v C T Z N is a Lagrangian subspace if a z \ v = 0. The set 

L(T z N) = {veG n (T z M):a z \ v = 0} 

of all Lagrange subspaces of T Z M is a Lagrange Grassmannian. 

Hamiltonian flow e'^ preserves the symplectic form, ^e*^ a = a. Hence 

^tti^j . T Z N — > T e± %^N transforms Lagrangian subspaces in the Lagrangian 

ones. It follows that the Jacobi curve J z (t) = ^e - '^ T^n^E^^ consists 

of Lagrangian subspaces, J z (t) G L(T Z N). 

We need few simple facts on Lagrangian Grassmannians (see Sec. 6 for 
the basic information and jHl Sec. 4] for a consistent description of their 
geometry). Let (E, a) be a 2n-dimensional symplectic space and v ,Vi G 
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be a pair of transversal Lagrangian subspaces, v fl V\ = 0. Bilinear 
form a induces a non degenerate pairing of the spaces v and v i by the 
rule (e, /) i— > <x(e, /), e E vo, f E v±. To any basis e±, . . . , e n of -l> we m &y 
associate a unique dual basis fi, . . . , f n of ^1 such that (x(ej, /}) = 5^-. The 
form a is totally normalized in the basis e±, . . . , e n , fi, . . . , f n of E, since 
a(ej,ej) = /j) = 0. It follows that symplectic group 

Sp(E) = {Ae GL(E) : <r(Ae, A/) = <r(e, /), e, / G E} 

acts transitively on the pairs of transversal Lagrangian subspaces. 

Next result is a 'symplectic specification' of Lemma II. 1 from Section 9. 

Lemma II. 7 Let v E L(H); then {ir VVQ : v E v§ fl £(S)} is an affine 
subspace of the affine space {n vvo : v E v§ } characterized by the relation: 

Proof. Assume that v\ G Vq fl L(T). Let e, / G E, e = eo + e±, f = fo + fi 
where e*, /j E Vi, i = 0, 1; then 

o-(e, /) = o-(e + ei, / + /i) = a(e , /i) + tf(ei, / ) = 

o-(e , /) + a{e, f ) = a(ir VlVo e, f ) + a(e, ir VlV J). 

Conversely, let v G Vq is not a Lagrangian subspace. Then there exist e, f E v 
such that <r(e, /) ^ 0, while a"(7r TOO e, /) = a(e, ir vvo f) = 0. □ 

Corollary II. 4 Letv(-) be an ample curve in G„(E) andv°(-) be the deriva- 
tive curve ofv(-). If v(t) E L(E), Vt, toen G L(E). 

Proof. The derivative curve t>° was defined in Section 11. Recall that 
Kv°(t)v{t) — ^ti where 7if is the free term of the Laurent expansion 

oo 

^v(r)v(t) ~ ^2 ( T " *) t7r * - 

i=—kt 

The free term 7r° belongs to the affine hull of 7r„( r )„(t), when r runs a neigh- 
borhood oft. Since 7Tu( T )„(t) belongs to the affine space {n VVo : v E v§ flL(E)}, 
then 7if belongs to this affine space as well. □ 

We call a Lagrange distribution any rank n vector distribution {A z C 
T Z N : z E N} on the symplectic manifold N such that A 2 G L(T Z N), z E N. 
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Corollary II. 5 Canonical Ehresmann connection £^ = {J°(0) : z G N} 

— * 

associated to an ample Hamiltonian field ( = h is a Lagrange distribution. 
□ 

It is clearly seeing in coordinates how Lagrange Grassmanian is sitting 
in the usual one. Let £ = W l * x R n = {(rj,y) : r] G R n *,y G W 1 }. Then 
any v G ({0} x R n ) has a form v = {(y T , Sy) : y G IR n }, where S is an 
n x n-matrix. It is easy to see that v is a Lagrangian subspace if and only if 
S is a symmetric matrix, S = S T . 



18 Monotonicity 

We continue to study curves in the Lagrange Grassmannian L(T Z N), in par- 
ticular, the Jacobi curves t i— > T et g^E e ts^ z y In Section 6 we identi- 
fied the velocity A(t) of any smooth curve A(-) in L(T Z N) with a quadratic 
form A(t) on the subspace A(£) C T Z N . Recall that the curve A(-) was 
called monotone increasing if A(t) > 0, Vi; it is called monotone decreasing 
if A(t) < 0. It is called monotone in both cases. 

Proposition II. 8 Set A(t) = (e^^j T et g^E et g^y then quadratic form 
A(t) is equivalent (up to a linear change of variables) to the form 



,^-(,o,H)(e tH (z)), (21) 



on E t fi 



W 

Proof. Let z t = e tH (z) ) then 

- A 

e~ eIi T E 



dt [) ~dt* L ^~ e * d£ 



£=0 



Set A(e) = e~ sH T Zt+F E Zt+e G L(T Zt N). It is enough to prove that A(0) is 
equivalent to form (21). Indeed, A(t) = e { ^ t)H T Zt A{0), where 

e (to-t)H . TztN ^ Tzt N 







is a symplectic isomorphism. The association of the quadratic form A(t) 
on the subspace A(t) to the tangent vector A(t) G L {T ZtQ N^ is intrinsic, 
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i.e. depends only on the symplectic structure on T ZtQ N. Hence A(0)(£) = 

A(t) (el <tt - t ^) ) veeA(o)=r^. 

What remains, is to compute A(0); we do it in the Darboux-Weinstein 
coordinates z = (x,y). We have: A(e) = 



fr) = ^)S-y^-r)+V(rV0(z t - T ), £(0) = £ G I 

V(r) = -0(*-r)e(r) T - 0- x (z t - T )r)(T), V (0) = G 



A(0)(0 = a ((£,0), (e(0),7)(0))) = £)(0) = -£0(*)£ T . 

Recall now that form (21) has matrix ^-(z t ) in the Darboux-Weinstein 
coordinates. □ 

This proposition clearly demonstrates the importance of monotone curves. 
Indeed, monotonicity of Jacobi curves is equivalent to the convexity (or con- 
cavity) of the Hamiltonian on each leaf of the Lagrange foliation. In the case 
of a cotangent bundle this means the convexity or concavity of the Hamil- 
tonian with respect to the impulses. All Hamiltonians (energy functions) 
of mechanical systems are like that! This is not an occasional fact but a 
corollary of the list action principle. Indeed, trajectories of the mechanical 
Hamiltonian system are extremals of the least action principle and the en- 
ergy function itself is the Hamiltonian of the correspondent regular optimal 
control problem as it was considered in Section 7. Moreover, it was stated 
in Section 7 that convexity of the Hamiltonian with respect to the impulses 
is necessary for the extremals to have finite Morse index. It turns out that 
the relation between finiteness of the Morse index and monotonicity of the 
Jacobi curve has a fundamental nature. A similar property is valid for any, 
not necessary regular, extremal of a finite Morse index. Of course, to for- 
mulate this property we have first to explain what are Jacobi curve for non 
regular extremals. To do that, we come back to the very beginning; indeed, 
Jacobi curves appeared first as the result of calculation of the ^-derivative 
at the regular extremal (see Sections 7, 8). On the other hand, ^-derivative 
is well-defined for any extremal of the finite Morse index as it follows from 
Theorem 1.1. We thus come to the following construction in which we use 
notations and definitions of Sections 3, 4. 

Let h(X, u) be the Hamiltonian of a smooth optimal control system, 
At, to < t < ti, an extremal, and q(t) = 7r(A t ), to,<t< t\ the extremal path. 
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Recall that the pair (X to ,X t ) is a Lagrangian multiplier for the conditional 
minimum problem defined on an open subset of the space 



M x L 0O ( [t , t x ],U) = {(q t ,u(-)) : q G M,u{-) G L^Kh^U)}, 



where u(-) is control and qt is the value at t of the solution to the differential 
equation q = f(q,u(r)), r G [to, ti]. In particular, F t (q t ,u(-)) = qt- The cost 
is j£(g t ,w(-)) and constraints are F to (q t ,u(-)) = q(0), q t = q(t). 

Let us set Jt(u) = J t * o (g(t), «(•)), = F to (q(t), «(•)). A covector 

A G T*M is a Lagrange multiplier for the problem (J t , $ t ) if and only if 
there exists an extremal A T , t Q < r < t, such that X to = A, A t G T*^M. In 
particular, A to is a Lagrange multiplier for the problem ( J t , $ t ) associated to 
the control «(•) = u(X_). 

Assume that indHess n ^J tl J ^ i C«C*q)) ) — °°> *° — ^ — ^ an< ^ se ^ ^* = 

( J t , $t). The curve 



discontinues, but it is monotone decreasing in a sense we are going to briefly 
describe now. You can find more details in [2] (just keep in mind that sim- 
ilar quantities may have opposite signs in different papers; sign agreements 
vary from paper to paper that is usual for symplectic geometry). Mono- 
tone curves in the Lagrange Grassmannian have analytic properties similar 
to scalar monotone functions: no more than a countable set of discontinuity 
points, right and left limits at every point, and differentiability almost every- 
where with semi-definite derivatives (nonnegative for monotone increasing 
curves and nonpositive for decreasing ones). True reason for such a mono- 
tonicity is a natural monotonicity of the family $ 4 . Indeed, let r < t, then 
$ r is, in fact, the restriction of $j to certain subspace: l T = i ( o p r , where 



discontinues) monotone curve in the Lagrange Grassmannian and the rela- 
tion between the Morse and Maslov index indices from Theorem 1.3 remains 
true. 

In fact, Maslov index is a key tool in the whole construction. The starting 
point is the notion of a simple curve. A smooth curve A(r), ro < r < T\, in 





S < T 



S > T 



. One can define the Maslov index of a (maybe 
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the Lagrange Grassmannian L(Ti) is called simple if there exists A G L(E) 
such that A n A(r) = 0, Vr G [to,Ti]; in other words, the entire curve 
is contained in one coordinate chart. It is not hard to show that any two 
points of L(E) can be connected by a simple monotone increasing (as well 
as monotone decreasing) curve. An important fact is that the Maslov index 
/x(A n (-)) °f a simple monotone increasing curve A(r), r < r < T\ is uniquely 
determined by the triple (n, A(ro), A(ti)); i.e. it has the same value for all 
simple monotone increasing curves connecting A(ro) with A(ri). A simple 
way to see this is to find an intrinsic algebraic expression for the Maslov index 
preliminary computed for some simple monotone curve in some coordinates. 
We can use Lemma 1.2 for this computation since the curve is simple. The 
monotonic increase of the curve implies that Sa^) > S A ^. 

Exercise. Let So, S\ be nondegenerate symmetric matrices and Si > So- 
Then indSo — indSi = ind (S^ 1 — S^ 1 ) . 

Let x G (A(r ) + A(ti) fill so that x = x + Xi, where Xi G A (7$), i — 0,1. 
We set q(x) = a(x 1 ,x ). If A(r ) n A(ri) = 0, then A(r ) + A(ti) = S, x is 
any element of II and xo,xi are uniquely determined by x. This is not true 
if A(to) H A(ri) 7^ but q(x) is well-defined anyway: a{xi,X2) depends only 
on Xo + X\ since cr vanishes on A(rj), i = 0, 1. 

Now we compute q in coordinates. Recall that 

Afr) = {(y T , 5 A(Ti) y) : yW 1 }, i = 0, 1, U = {y T , 0) : y G M n }. 
We have 

1^) = ViSa^Vo ~ VoSa^Vu 
where x = (y£ + y7> °)> ^A(ro)?/o + 5 A(Tl) yi = 0. Hence y 1 = -S A l taUi) S A{n)) y 
and 

q(x) = -y^S A{To) y - (S A{ro) y ) T S^ Ti) S A(n)) y = y T (s^ - S A ^ Tl) ) V, 

where y = S A ( T0 )y . We see that the form q is equivalent, up to a linear change 
of coordinates, to the quadratic form defined by the matrix S^}, — S^}y 
Now we set 

ind n (A(r ),A(ri)) ^indq. 
The above exercise and Lemma 1.2 imply the following: 

Lemma II. 8 If A(t), t < r < T\, is a simple monotone increasing curve, 
then 

MA(-)) = indn(A(r ),A(r 1 )). 
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Note that definition of the form q does not require transversality of A(-Tj) 
to IT. It is convenient to extend definition of indn(A(r ), A(ri)) to this case. 
General definition is as follows: 

indn(A , Ai) = ind q + i(dim(II n A ) + dim(II n A x )) - dim(II n A n Ai). 

The Maslov index also has appropriate extension (see Sec. 4]) and Lemma 
II. 8 remains true. 

Index indn(A ,Ai) satisfies the triangle inequality: 

indn(A ,A 2 ) < ind n (A , Ai) + ind n (Ai, A 2 ). 

Indeed, the right-hand side of the inequality is equal to the Maslov index of a 
monotone increasing curve connecting Ao with A 2 , i.e. of the concatenation 
of two simple monotone increasing curves. Obviously, the Maslov index of 
a simple monotone increasing curve is not greater than the Maslov index of 
any other monotone increasing curve connecting the same endpoints. 

The constructed index gives a nice presentation of the Maslov index of 
any (not necessary simple) monotone increasing curve A(t), t < t < t\\ 

i 

MAO)) = ^ind n (A(r,),A(r m )), (22) 

i=0 

where to = r o < r i < • • ■ < T i < T i+i = h and A , , are simple pieces of 

the curve A(-). If the pieces are not simple, then the right-hand side of (22) 

gives a low bound for the Maslov index (due to the triangle inequality). 

Let now A(t), to < t < t%, be a smooth curve which is not monotone 

increasing. Take any subdivision t = t < T\ < ■ • • < t\ < ti + i = t\ and 
i 

compute the sum ^2 indn(A(rj), A(r i+1 )). This sum inevitably goes to infinity 

when the subdivision becomes finer and finer. The reason is as follows: 
indn(A(rA A(rj + i)) > for any simple piece A| r . such that Mr) ^ 

0, Vr E [ Tj,Tj +1 ] and /^n (A L , = 0. I advise reader to play with the 

one-dimensional case of the curve in L(IR 2 ) = S 1 to see better what's going 
on. 

This should now be clear how to manage in the general nonsmooth case. 
Take a curve A(-) (an arbitrary mapping from [to,^i] into L(S)). For any 
finite subset T + {ri, . . . , r^} C [to, h], where t = r < T\ < ■ ■ ■ < T\ < 
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n+i = t\, we compute the sum 7^ = S m dn(A(rj), A(r i+ i)) and then find 

2=0 

supremum of these sums for all finite subsets: 7n(A(-)) = sup/n- The curve 



r 



A(-) is called monotone increasing if 1^ < oo; it is not hard to show that the 
last property does not depend on II and that monotone increased curves enjoy 
listed above analytic properties. A curve A(-) is called monotone decreasing 
if inversion of the parameter t i— > to + ti — t makes it monotone increasing. 

We set /i(A(-)) = Jn(A(-)) for any monotone increasing curve and 
/i(A(-)) = — Jn(A(-)) for a monotone decreasing one, where A(t) = A(t + 
ti — t). The defined in this way Maslov index of a discontinues monotone 
curve equals the Maslov index of the continues curve obtained by gluing all 
discontinuities with simple monotone curves of the same direction of mono- 
tonicity. 

If A(t) = C^Xt ,u)(^t) is the Jacobi curve associated to the extremal with 
a finite Morse index, then A(-) is monotone decreasing and its Maslov index 

computes indHess^ (jtil^-i^^^) m t ne wa Y similar to Theorem 1.3. Of 

course, these nice things have some value only if we can effectively find Jacobi 
curves for singular extremals: their definition was too abstract. Fortunately, 
this is not so hard; see [3] for the explicit expression of Jacobi curves for 
a wide class of singular extremals and, in particular, for singular curves of 
rank 2 vector distributions (these last Jacobi curves have found important 
applications in the geometry of distributions, see [TH ITi]). 

One more important property of monotonic curves is as follows. 

Lemma II. 9 Assume that A(-) is monotone and right- continues at t , i.e. 
A(t ) = lim A(t). Then A(t ) fl A(t) = f] A(t) for any t sufficiently close 

*Vo t <T<t 

to (and greater than) to. 



Proof. We may assume that A(-) is monotone increasing. Take centered 
at A(to) local coordinates in the Lagrange Grassmannian; the coordinate 
presentation of A(t) is a symmetric matrix S^t), where SA(t ) — and t \— > 
y 1 S\(t)y is a monotone increasing scalar function \/y e IR n . In particular, 
ker S\(t) = A(t) fl A(t ) is a monotone decreasing family of subspaces. □ 
We set T t = f] A(r), a monotone decreasing family of isotropic sub- 

t <T<t 

spaces. Let T = ma.xT t , then T t == T for all t > to sufficiently close to to- 

t>t 

We have: A(t) = A(t) z and A(t) D T for all t > t close enough to t ; hence 
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Tf D A(t). In particular, A(t) can be treated as a Lagragian subspace of the 
symplectic space r z /T. Moreover, Lemma II. 9 implies that A(t) H A(t ) = T. 
In other words, A(t) is transversal to A(to) in T z /T. In the case of a real- 
analytic monotone curve A(-) this automatically implies that A(-) is an ample 
curve in T z /T. Hence any nonconstant monotone analytic curve is reduced 
to an ample monotone curve. It becoms ample after the factorization by a 
fixed (motionless) subspace. 

19 Comparizon theorem 

We come back to smooth regular curves after the deviation devoted to a more 
general perspective. 

Lemma 11.10 Let A(t), t e [to?^i] be a regular monotone increasing curve 
in the Lagrange Grassmannian L(E). Then {t e [to? ^i] : A(£) fl II ^ 0} is a 
finite subset of [to,ti] VII e L(H). If to and t\ are out of this subset, then 

A*n(A(-))= dim(A(t)nn). 

te(to,*i) 

Proof. We have to proof that A(t) may have a nontrivial intersection with II 
only for isolated values oft; the rest is Lemma 1.1. Assume that A(t) nil ^ 
and take a centered at II coordinate neighborhood in L(E) which contains 
A(t). In these coordinates, A(r) is presented by a symmetric matrix S\(t) 
for any r sufficiently close to t and A(r) n II = kerS , A ( T ). Monotonicity and 
regularity properties are equivalent to the inequality Sa(t) > 0. In particular, 
y T SA(t)U > My G kerSA(t) \ {0}. The last inequality implies that Sa( t ) is a 
nondegenerate for all r sufficiently close and not equal to t. 

Definition. Parameter values r , T\ are called conjugate for the continues 
curve A(-) in the Lagrange Grassmannian if A(tq) n A(ri) ^ 0; the dimension 
of A(to) n A(ri) is the multiplicity of the conjugate parameters. 

If A(-) is a regular monotone increasing curve, then, according to Lemma 
II. 9, conjugate points are isolated and the Maslov index /iA(t ) {^\[t ti]) e( l ua l s 
the sum of multiplicities of the conjugate to to parameter values located 
in (t,ti). If A(-) is the Jacobi curve of an extremal of an optimal control 
problem, then this Maslov index equals the Morse index of the extremal; 
this is why conjugate points are so important. 
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Given a regular monotone curve A(-), the quadratic form A(t) defines 
an Euclidean structure (•, - )A(t) 011 MO so that A(£)( a; ) — ( x i x )\(t)- Let 
R\(t) G gl(A(£)) be the curvature operator of the curve A(-); we define the 
curvature quadratic form r\(t) on A(£) by the formula: 

r A (t)(x) = (R A (t)x,x) k(t) , x G A(t). 

Proposition II. 9 T/ie curvature operator R\(t) is a self-adjoint operator 
for the Euclidean structure (v)A(t)- ^ e / orm r A(0 equivalent (up to 
linear changes of variables) to the form A (£), where A°(-) £/ie derivative 
curve. 

Proof. The statement is intrinsic and we may check it in any coordinates. 
Fix t and take Darboux coordinates {(r),y) : 77 G IR n *,?/ G M n } in £ in 
such a way that A(t) = {(y T ,0) : y G M n }, A°(t) = {(0,y) : y G R"}, 
A(£)(y) = y T y. Let A(r) = {(y T ,S r y) : y G M n }, then 5 t = 0. Moreover, 
S(t) is the matrix of the form A(t) in given coordinates, hence S t = I- Recall 
that A°(r) = {(y T A T ,y + S T A T y) : y G W 1 }, where A T = -^S^SrS' 1 (see 
(5)). Hence S t = 0. We have: R A (t) = \St, r A (t)(y) = \y T St y, 

A°(t)(y) = a ((0,s/), (t/ T i t ,0)) = -y T i^ = ^y T S t y. 

So r A (t) and A°(t) have equal matrices for our choice of coordinates in A(t) 
and A°(t). The curvature operator is self-adjoint since it is presented by 
a symmetric matrix in coordinates where form A(t) is the standard inner 
product. □ 

Proposition II. 9 implies that the curvature operators of regular monotone 
curves in the Lagrange Grassmannian are diagonalizable and have only real 
eigenvalues. 

Theorem II. 1 Let A(-) be a regular monotone curve in the Lagrange Grass- 
mannian L(E), where dimS = 2n. 

• If all eigenvalues of R\(t) do not exceed a constant c > for any t 
from the domain of A(-), then \t\ — r \ > for any pair of conjugate 
parameter values t ,Ti. In particular, If all eigenvalues of R\{t) are 
nonpositive \/t, then A(-) does not possess conjugate parameter values. 
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• If tr R\(t) > nc for some constant c > and Wt, then, for arbitrary 
r < t, the segment [t, t + ^] contains a conjugate to r parameter 
value as soon as this segment is contained in the domain of A(-). 

Both estimates are sharp. 

Proof. We may assume without lack of generality that A(-) is ample mono- 
tone increasing. We start with the case of nonpositive eigenvalues of R\(t). 
The absence of conjugate points follows from Proposition II. 9 and the fol- 
lowing 

Lemma 11.11 Assume that A(-) is an ample monotone increasing (decreas- 
ing) curve and A°(-) is a continues monotone decreasing (increasing) curve. 
Then A(-) does not possess conjugate parameter values and there exists a 
lim A(t) = Aoo. 

Proof. Take some value of the parameter r ; then A(r ) and A°(r ) is a 
pair of transversal Lagrangian subspaces. We may choose coordinates in the 
Lagrange Grassmannian in such a way that S\( T )) = an d Sa°(to) = i- e - 
A(to) is represented by zero n x n-matrix and A°(r ) by the unit matrix. 
Monotonicity assumption implies that t h- > S\(t) is a monotone increasing 
curve in the space of symmetric matrices and t h- > S'Ao(t) is a monotone 
decreasing curve. Moreover, transversality of A(£) and A°(£) implies that 
S\°(t) — S\(t) is a nondegenerate matrix. Hence < < < I for any 

t > t . In particular, A(t) never leaves the coordinate neighborhood under 
consideration for T > r , the subspace A(t) is always transversal to A(r ) 
and has a limit Aoo, where = sup 5^). □ 

t>TQ 

Now assume that the eigenvalues of R\(t) do not exceed a constant c > 0. 
We are going to reparametrize the the curve A(-) and to use the chain rule 
(7). Take some t in the domain of A(-) and set 

(p(t) = -j= (arctan(v^t) + |) + i, A^t) = A(ip(t)). 

We have: <p(R) = (t,t+^j, tp{t) = J^, R v (t) = -p^. Hence, 
according to the chain rule (7), the operator 

R ^= {c / +1 y (RMt))-d) 
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has only nonpositive eigenvalues. Already proved part of the theorem implies 
that Ay, does not possess conjugate values of the parameter. In other words, 
any length ^ interval in the domain of A(-) is free of conjugate pairs of the 
parameter values. 

Assume now that tr R\(t) > nc. We will prove that the existence of 
A G L(E) such that AnA(i) = for all t E [t, r] implies that r-t < ^. We'll 

prove it by contradiction. If there exists such a A, then A|j f ^ is completely 
contained in a fixed coordinate neighborhood of L(T,), therefore the curvature 
operator R\(t) is defined by the formula (6). Put B(t) = (2S t )~~ 1 S t , b(t) = 
trB(t), t e [t,r]. Then 

B(t) = B 2 (t) + R A (t), b(t) = tr B 2 (t) + tvR A (t). 

Since for an arbitrary symmetric n x n-matrix A we have trA 2 > ^(trA) 2 , 
the inequality b > ^ + nc holds. Hence b(t) > /3(t), t <t < r, where (3{-) is 
a solution of the equation (3 = ^ + nc, i.e. (3(t) = ny / ctan( v /c(t — to)). The 
function b(-) together with /?(•) are bounded on the segment [i, r]. Hence 
T-t< 

— Vc 

To verify that the estimates are sharp, it is enough to consider regular 
monotone curves of constant curvature. □ 



20 Reduction 

We consider a Hamiltonian system on a symplectic manifold N endowed with 
a fixed Lagrange foliation E. Assume that g : N — > R is a first integral of 
our Hamiltonian system, i.e. {/i, = 0. 

Lemma 11.12 Let z <E N, g(z) = c. The leaf E z is transversal to 5'~ 1 (c) at 
z if and only if g(z) T Z E Z . 

Proof. Hypersurface 5 f_1 (c) is not transversal to (? _1 (c) at z if and only if 

d z g{T z E z ) = a(g(z),T z E z ) = & g{z) e (T Z E Z ) Z = T Z E Z . D 

If all points of some level g~ l {c) satisfy conditions of Lemma 11.12, then 
g _1 (c) is a (2n-l)-dimensional manifold foliated by (n — l)-dimensional sub- 
manifolds E z fl (? _1 (c). Note that M.g(z) = ker<r| r hence 
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E| d = T z g 1 (c)/Wg(z) is a 2(n — l)-dimensional symplectic space and 



Af T 2 (E z fl <7 -1 (c)) is a Lagrangian subspace in L 9 Z , i.e. Af G L(E 

The submanifold g _1 (c) is invariant for the flow e th . Moreover, e^g = 
g. Hence e* ft induces a symplectic transformation ef 1 : Ef — > E^ tS . Set 

J|(£) = e~ th A 9 t ^^ . The curve £ t— > in the Lagrange Grassmannian 

L(H 9 Z ) is called a reduced Jacobi curve for the Hamiltonian field h at z G iV. 

The reduced Jacobi curve can be easily reconstructed from the Jacobi 
curve J z (t) = e~ th tS(z) £ etS(z) J G L(T Z N) and vector g(z). An elementary 
calculation shows that 

J*(t) = J z (t)f]g(z) z + Rg(z). 

Now we can temporary forget the symplectic manifold and Hamiltonians and 
formulate everything in terms of the curves in the Lagrange Grassmannian. 
So let A(-) be a smooth curve in the Lagrange Grassmannian L(E) and 7 a 
one-dimensional subspace in E. We set A 7 (t) = A(t) fl 7 Z + 7, a Lagrange 
subspace in the symplectic space 7 Z/ 7. If 7 ^ A(t), then A 7 (-) is smooth and 
A 7 (t) = A.(t)\ A ^ ny z as ^ easily follows from the definitions. In particular, 
monotonicity of A(-) implies monotonicity of A 7 (-); if A(-) is regular and 
monotone, then A 7 (-) is also regular and monotone. The curvatures and the 
Maslov indices of A(-) and A 7 (-) are related in a more complicated way. The 
following result is proved in 0. 

Theorem II. 2 Let A(t), t G [to? ^1] ^ e a smooth monotone increasing curve 
in L(E) and 7 a one- dimensional subspace of E such that 7 (jL A(£), Vt G 
[*o,*i]. Let LT G L(E), 7 G: II, A(t )nn = A(ti)nn = 0. Then 

. /in(A(-))<^(A 7 (-))</^n(A(-)) + l- 
• //A(-) zs regular, then r\i(t) > r A (t) | A ™ n ^ an <^ 



rank (r A7 (f) - r A (0| A(t)rV ) ^ L 



The inequality r^(t) > r A (t) | A / t ) n ^ turns into the equality if 

7 G A°(t), Vt. Then 7 G kerA (t). According to Proposition II. 9, to 7 
there corresponds a one-dimensional subspace in the kernel of r A (t); in par- 
ticular, r A (t) is degenerate. 
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Return to the Jacobi curves J z (t) of a Hamiltonian field h. There al- 
ways exists at least one first integral: the Hamiltonian h itself. In general, 
h(z) ^ Jg(Q) and the reduction procedure has a nontrivial influence on the 
curvature (see [E1IH] for explicit expressions). Still, there is an important class 
of Hamiltonians and Lagrange foliations for which the relation h(z) G J°(0) 
holds \/z. These are homogeneous on fibers Hamiltonians on cotangent bun- 
dles. In this case the generating homotheties of the fibers Euler vector field 
belongs to the kernel of the curvature form. 

21 Hyperbolicity 

Definition. We say that a Hamiltonian function h on the symplectic man- 
ifold iV is regular with respect to the Lagrange foliation E if the functions 
h\ E have nondegenerate second derivatives at z, Vz G iV (second derivative 
is well-defined due to the canonical afline structure on E z ). We say that h is 
monotone with respect to E if h\„ is a convex or concave function \fz G N. 

Typical examples of regular monotone Hamiltonians on the cotangent 
bundles are energy functions of natural mechanical systems. Such a func- 
tion is the sum of the kinetic energy whose Hamiltonian system generates 
the Riemannian geodesic flow and a "potential" that is a constant on the 
fibers function. Proposition II. 8 implies that Jacobi curves associated to the 
regular monotone Hamiltonians are also regular and monotone. We'll show 
that negativity of the curvature operators of such a Hamiltonian implies the 
hyperbolic behavior of the Hamiltonian flow. This is a natural extension of 
the classical result about Riemannian geodesic flows. 

Main tool is the structural equation derived in Section 13. First we'll 
show that this equation is well coordinated with the symplectic structure. 
Let A(t), t G R, be a regular curve in L(E) and £ = A(t) A°(t) the 
correspondent canonical splitting. Consider the structural equation 

e(t) + R A {t)e{t) = 0, where e{t) G A(t), e{t) G A°(t), (23) 

(see Corollary II. 1). 

Lemma 11.13 The mapping e(0) © e(0) i— > e(t) © e(t), where e(-) and e(-) 
satisfies (23), is a symplectic transformation ofH. 
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Proof. We have to check that a [e x (£), e 2 (t)), a(£i(t), e 2 (t)), a(e 1 (t), e 2 (t)) 
do not depend on t as soon as ej(i),ej(i), % — 1,2, satisfy (23). First two 
quantities vanish since A(t) and A°(£) are Lagrangian subspaces. The deriva- 
tive of the third quantity vanishes as well since e\(t) G A(i). □ 

Let /i be a regular monotone Hamiltonian on the symplectic manifold 
A" equipped with a Lagrange foliation E. As before, we denote by J z (t) the 
Jacobi curves of h and by (t) the reduced to the level of h Jacobi curves (see 
previous Section). Let R(z) = Rj z (0) and R h (z) = Rjh(0) be the curvature 
operators of J z {) and J z (•) correspondently. We say that the Hamiltonian 
field h has a negative curvature at z with respect to E if all eigenvalues of 
R(z) are negative. We say that h has a negative reduced curvature at z if all 
eigenvalues of R h z are negative. 

Proposition 11.10 Let z G N, z t = e th (z). Assume that that {z t : t G K} zs 
a compact subset of N and that N is endowed with a Riemannian structure. 
If h has a negative curvature at any z G {z t : t G R} ; then there exists a 
constant a > and a splitting T zt N = A+ © A~ ; where Af t are Lagrangian 
subspaces ofT Zt N such that ef(Af t ) = Af t+r W, rGl and 

||e^C±ll >e QT ||C±ll V(GA±,r>0,tGl. (24) 

Similarly, ifh has a negative reduced curvature at any z G {z t : t G M} 7 t/ien 
t/iere exists a splitting T Zt {h~ x (c) /M.h(zt)) = A+ © A~ 7 where c = h(z ) and 
Af t are Lagrangian subspaces of T Zt (h~ 1 (c)/'Rh(zt)) such that el h (Af t ) = 
Af t+T Vt.rGl and \\ef rfl (±\\ > e aT \\(±\\ V( G Aj, r > 0, t G R. 

Proof. Obviously, the desired properties of A^ and Af t do not depend on 
the choice of the Riemannian structure on N. We'll introduce a special Rie- 
mannian structure determined by h. The Riemannian structure is a smooth 
family of inner products (•, -) z on T Z N, z G N. We have T Z A^ = J z (0) © J°(0), 
where J a (0) = T^i? z . Replacing /i with —h if necessary we may assume that 
h\ E is a strongly convex function. First we define (•, -) 2 | J to be equal to 

the second derivative of h\ E . Symplectic form a induces a nondegenerate 
pairing of J z (0) and J°(0). In particular, for any ( G J 2 (0) there exists a 
unique C° G J°(0) such that cr((°, -)| 7 = (£, J ^. There exists a unique 
extension of the inner product (•, -) z from J 2 (0) to the whole T^A^ with the 
following properties: 
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• J°(0) is orthogonal to J z (0) with respect to (•, -) z ; 

• (Ci,C2), = (Ci o ,C 2 o ),vCi,C 2 g^(0). 

We'll need the following classical fact from Hyperbolic Dynamics (see, for 
instance, O Sec. 17.6]). 

Lemma 11.14 Let A(t), t £ R, be a bounded family of symmetric n x n- 
matrices whose eigenvalues are all negative and uniformly separated from 0. 
Let T(t,r) be the fundamental matrix of the In- dimensional linear system 
x = —y, y = A(t)x, where x, y G M n ; i.e. 

^(t,r) = (° A - I )r(t,r), r(r,T) = (£?). (25) 

Then there exist closed conic neighborhoods Cpt, C^ , where Cp n Cp = 0, of 
some n-dimensional subspaces ofM? n and a constant a > such that 

T(t,r)C+ C C+, \T(t,r)C+\ > e^K+l, V£ + G C+, t < r, 

and 

r(t,r)Cf C Cp, |r(t,r)M > e a ^|£_|, V£_ G C r , t > r. 

The constant a depends only on upper and lower bounds of the eigenvalues 
of A{t). □ 

Corollary II.6 Let Cp be the cones described in Lemma 11.14; then 
r(0,±t)Cf C r(0;±r)Cj£ for any t > r > and the subsets 
Kp = p) r(0,t)Cp are Lagrangian subspaces ofM. n x R n equipped with the 

standard symplectic structure. 

Proof. The relations r(r,f)C+ c C+ and r(r,i)C r C Cf imply: 

r(o, ±t)c£ = r(o, ±r)r(±r, ±t)c£ c r(o, ±r)cp. 

In what follows we'll study Kp ; the same arguments work for i^p . Take 
vectors (,(' e ^ri then C = r(0,t)C* and C' = r(0,t)C t ' for any t > and 
some CtX't e ^r- Then, according to Lemma 11.14, |£ t | < e~ Q *|£|, \Q\ < 
e~ at \('\, i.e. an d Ct tend to as t — > +oo. On the other hand, 

(7(6 CO = *(r(o, t)c 4 , r(o, t)C) = a(Ct, CD vt > o 



66 



since T(0,t) is a symplectic matrix. Hence cr(C, CO = 0. 

We have shown that Xp is an isotropic subset of R n x R n . On the other 
hand, JTp contains an n-dimensional subspace since Cp~ contains one and 
r(0, i) are invertible linear transformations. Isotropic n-dimensional sub- 
space is equal to its skew-orthogonal complement, therefore is a La- 
grangian subspace. □ 

Take now a regular monotone curve A(t), t G R in the Lagrange Grass- 
mannian !/(£). We may assume that A(-) is monotone increasing, i.e. A(t) > 
0. Recall that A(t)(e(t)) = cr(e(t),e(t)), where e(-) is an arbitrary smooth 
curve in E such that e(r) G A(t), Vt. Differentiation of the identity 
o-(ei(r),e 2 (r)) = implies: tr(ei(t), e 2 (t)) = -<r(ei(t), e 2 (t)) = 0"(e 2 (O> ei(f)) 
if ej(r) G A(r), Vr, i — 1,2. Hence the Euclidean structure (•, -)A(t) defined 
by the quadratic form A(t) reads: (ei(t), e2(t))hu) = a ( e i(t)^2(t))- 

Take a basis ei(0), . . . , e n (0) of A(0) such that the form A(t) has the 
unit matrix in this basis, i.e. <r(ej(0), e^O)) = 5ij. In fact, vectors ej(0) 
are defined modulo A(0); we can normalize them assuming that ej(0) G 
A°(0), % = 1, . . . ,n. Then ei(0), . . . , e„(0), ei(0), . . . , e n (0) is a Darboux basis 
of S. Fix coordinates in £ using this basis: £ = R n x R Tt , where (y) G 

n 

R n x R n is identified with £ (^(0) + y j ej(0)) G S, a: = (a; 1 , . . . , x n ) T , 
1/ = (y\---,2/ n ) T - 

We claim that there exists a smooth family A(t), £ G R, of symmetric 
n x n matrices such that A(t) has the same eigenvalues as R^it) and 

A(t) = r(0,t)( K "), A°(t) = r(0,t)( R °„), Vt G R 

in the fixed coordinates, where r(t,r) satisfies (25). Indeed, let ej(t), 2 = 
1, . . . ,n, be solutions to the structural equations (23). Then 

A(t) = span{e!(t), e n (t)}, A°(t) = span{ei(t), e n (t)}. 

n 

Moreover, e^t) = — a ij(t)ej(t), where A(t) = (t)}™ j=1 is the matrix of 
i=i 

the operator R\(t) in the 'moving' basis ei(t), . . . , e n (t). Lemma 1.13 implies 
that (ei(t),ej(t))fa t \ = cr(ei(t),ej(t)) = 5ij. In other words, the Euclidean 
structure (•, - )A(t) nas un ^ m &trix in the basis ei(t), . . . , e n (t). Operator R\(t) 
is self-adjoint for the Euclidean structure (•, •)\^ (see Propositon II. 9). Hence 
matrix A(t) is symmetric. 
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Let ei(t) = e M n x R n in the fixed coordinates. Make upnxn- 

matrices X(t) = (xi(t), . . . , x„(t)), Y(t) = (yi(t), . . . ,y n (t)) and a 2n x 2n- 



matrix 



X(t) X(t) 



Y(t) Y(t) 

d fX X 



dt \Y Y 



it) 



. We have 



A v\ .. /() -A(t)\ fX X 



Y Y 



(t) 



J ' 



Y Y 



(0) 



/ 
/ 



Hence (** ) (t) = T(t, 0)" 1 = T(0, t). 

Let now A(-) be the Jacobi curve, A(t) = J Zo (t). Set ii(z t ) = e'^e^t), 
Vi(zt) = e{ h ei(t); then 

£l(zt), £n(zt),Vl(zt), ■■■,Vn(zt) (26) 

is a Darboux basis of T Zt N, where J Zt (0) = span{^i(z t ) , . . . , Cn(zt)}, J° t (^) = 
span{r)i(z t ) , . . . , r) n (z t )} . Moreover, the basis (26) is orthonormal for the 
inner product (•, -) Zt on T Zt N. 

The intrinsic nature of the structural equation implies the translation 
invariance of the construction of the frame (26): if we would start from z s 
instead of z and put A(t) = J Zs (t), e«(0) = £i(z s ), ej(0) = r)i(z s ) for some 
s e R, then we would obtain efe^t) = £i(z s+t ), efe^t) = r)i(z s+t ). 

The frame (26) gives us fixed orthonormal Darboux coordinates in T Zs N 
for Vs G K. and the correspondent symplectic 2n x 2n-matrices r Za (r,t). We 
have: r Zs (r,t) == T ZQ (s + r, s + t); indeed, r Zs (r, £) (^) is the coordinate 
presentation of the vector 

e (r-t)h ^j?( Za+t )+y%( Za+t ) 

in the basis ^j(z s+r ), r)i(z s+T ). In particular, 



|r*(<M)(5: 



J2^C(z s+ t) + y\0 



s+t) 



(27) 



Recall that £i(z T ), . . . , ^ti(^t), ^i(^t), • • • , Vn(z T ) is an orthonormal frame for 
the scalar product (•, -) Zt and ||C|L = a/ (C,C) Zt - 
We introduce the notation : 



J2xr(z s )+y\(z s ):( x y )eWy 
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for any W C W 1 x IR n . Let Cj? be the cones from Lemma 11.14. Then 

e; T \r z Mt)C±J Z3 _ T = [r Zs _ T (0,t + T)C± o \ Zs _ T , Vt,T,s. (28) 
Now set tf+ = fl C<r + V K~ = f) Cf and A± = ^J*. Corollary 

t>0 i<0 

II. 6 implies that A^ are Lagrangian subspaces of T Zg N '. Moreover, it follows 
from (28) that ef A J = A± +( , while (28) and (27) imply inequalities (24). 

This finishes the proof of the part of Proposition 11.10 which concerns 
Jacobi curves J z {t). We leave to the reader a simple adaptation of this proof 
to the case of reduced Jacobi curves J z {t). □ 

Remark. Constant a depends, of course, on the Riemannian structure on N. 
In the case of the special Riemannian structure defined at the beginning of the 
proof of Proposition 11.10 this constant depends only on the upper and lower 
bounds for the eigenvalues of the curvature operators and reduced curvature 
operators correspondently (see Lemma 11.14 and further arguments). 

Let e tx ,t G R be the flow generated by the the vector field X on a 
manifold M. Recall that a compact invariant subset W C M of the flow 
e tx is called a hyperbolic set if there exists a Riemannian structure in a 
neighborhood of W, a positive constant a, and a splitting T Z M = © 
E-®RX(z), zeW, such that X(z) ^ 0, ef Ef = Ef tX{z) , and ||ef ^11 > 
e^HC^I, Vt > 0, C ± e Ef. Just the fact some invariant set is hyperbolic 
implies a rather detailed information about asymptotic behavior of the flow 
in a neighborhood of this set (see ^2] for the introduction to Hyperbolic 
Dynamics). The flow e tx is called an Anosov flow if the entire manifold M 
is a hyperbolic set. 

The following result is an immediate corollary of Proposition 11.10 and 
the above remark. 

Theorem II. 3 Let h be a regular monotone Hamiltonian on N , c G R, W C 
h~ l (c) a compact invariant set of the flow e th , t G R ; and d z h ^ 0, G W . 
If h has a negative reduced curvature at every point of W , then W is a 
hyperbolic set of the flow e th \ h -i, c y D 

This theorem generalizes a classical result about geodesic flows on com- 
pact Riemannian manifolds with negative sectional curvatures. Indeed, if 
iV is the cotangent bundle of a Riemannian a Riemannian manifold and e th 
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is the geodesic flow, then negativity of the reduced curvature of h means 
simply negativity of the sectional Riemannian curvature. In this case, the 
Hamiltonian h is homogeneous on the fibers of the cotangent bundle and the 
restrictions eth \ h -if c \ are equivalent for all c > 0. 

The situation changes if h is the energy function of a general natural 
mechanical system on the Riemannian manifold. In this case, the flow and 
the reduced curvature depend on the energy level. Still, negativity of the 
sectional curvature implies negativity of the reduced curvature at h~ x (c) 
for all sufficiently big c. In particular, eth \ h -ir c \ is an Anosov flow for any 
sufficiently big c; see [8 , 9J for the explicit expression of the reduced curvature 
in this case. 

Theorem II. 3 concerns only the reduced curvature while the next result 
deals with the (not reduced) curvature of h. 

Theorem II. 4 Let h be a regular monotone Hamiltonian and W a compact 
invariant set of the flow e th . If h has a negative curvature at any point of 
W , then W is a finite set and each point of W is a hyperbolic equilibrium of 
the field h. 

Proof. Let z G W; the trajectory z t = e th (z), t£l, satisfies conditions of 
Proposition 11.10. Take the correspondent splitting T Zt N = A+ © A~. In 

particular, h(z t ) = h + (z t ) + h~(z t ), where K^fa) G Af. 

We have el h h(z t ) = h(z t+T ). Hence 

|fe +T )|| = \\efh(z t )\\ > \\efh+(z t )\\ - \\efh-(z t )\\ 

>e Q l/^)||-e- Q l/;-(^)||, Vr>0. 

Compactness of {z t : t G M} implies that h + (zt) is uniformly bounded; hence 
h+(z t ) = 0. Similarly, \\h{z t - T \\ > e aT \\h~{z t )\\ - e- aT \\h + (z t )\\ that implies 
the equality h~(z t ) = 0. Finally, h(zt) = 0. In other words, zt = z is an 
equilibrium of h and T Z N = A+©A~ is the splitting of T Z N into the repelling 
and attracting invariant subspaces for the linearization of the flow e th at z. 
Hence z is a hyperbolic equilibrium; in particular, z is an isolated equilibrium 
of h. □ 

We say that a subset of a finite dimensional manifold is bounded if it has 
a compact closure. 
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Corollary II. 7 Assume that h is a regular monotone Hamiltonian and h 
has everywhere negative curvature. Then any bounded semi-trajectory of the 
system z = h(z) converges to an equilibrium with the exponential rate while 
another semi-trajectory of the same trajectory must be unbounded. □ 

Typical Hamiltonians which satisfy conditions of Corollary II. 7 are en- 
ergy functions of natural mechanical systems in IR n with a strongly concave 
potential energy. Indeed, in this case, the second derivative of the poten- 
tial energy is equal to the matrix of the curvature operator in the standard 
Cartesian coordinates (see Sec. 15). 

References 

[1] A. A. Agrachev Topology of quadratic maps and Hessians of smooth 
maps. Itogi Nauki; Algebra, Topologiya, Geometriya, 1988, v.26, 85- 
124 (in Russian). English, transl.: J. Soviet Math., Plenum Publ. Corp., 
1990, 990-1013 

[2] A. A. Agrachev, R. V. Gamkrelidze, Symplectic geometry and necessary 
conditions for optimality. Matem. Sbornik, 1991, v. 182 (in Russian). 
English transl.: Math. USSR Sbornik, 1992, v. 72, 29-45 

[3] A. A. Agrachev, R. V. Gamkrelidze, Symplectic methods in optimization 
and control. In the book: Geometry of Feedback and Optimal Control. 
B. Jakubczyk, W. Respondek, Eds. Marcel Dekker, 1998, 19-77 

[4] A. A. Agrachev, R. V. Gamkrelidze, Feedback-invariant optimal control 
theory and differential geometry, I. Regular extremals. J. Dynamical and 
Control Systems, 1997, v.3, 343-389 

[5] A. A. Agrachev, Feedback-invariant optimal control theory and differ- 
ential geometry, II. Jacobi curves for singular extremals. J. Dynamical 
and Control Systems, 1998, v.4, 583-604 

[6] A. A. Agrachev, I. Zelenko, Geometry of Jacobi curves, I, II. J. Dynam- 
ical and Control Systems, 2002, v.8, 93-140; 167-215 

[7] A. A. Agrachev, Yu. L. Sachkov, Control theory from the geometric 
viewpoint. Springer Verlag, 2004, xiv+412pp. 



71 



[8] A. A. Agrachev, N. Chtcherbakova, Hamiltonian systems of negative 
curvature are hyperbolic. Russian Math. Dokl., 2005, v. 400 

[9] A. A. Agrachev, N. Chtcherbakova, I. Zelenko, On curvatures and focal 
points of dynamical Lagrangian distributions and their reductions by first 
integrals. J. Dynamical and Control Systems, to appear 

[10] V. I. Arnold, A. B. Givental, Symplectic geometry. Springer Verlag, En- 
cyclopedia of Mathematical Sciences, v.4, 1988, 1-136 

[11] B. Dubrov, I. Zelenko, Canonical frame for rank 2 distributions of max- 
imal class. C. R. Acad. Sci. Paris, submitted 

[12] A. Katok, B. Hasselblatt, Introduction to the modern theory of dynam- 
ical systems. Cambridge Univ. Press, 1998 

[13] J. W. Milnor, J. D. Stasheff, Characteristic classes. Princeton Univ. 
Press, 1974 

[14] I. Zelenko, Variational approach to differential invariants of rank 2 vec- 
tor distributions. J. Differential Geometry and Appl., to appear 



72 



